Patent · US Active

Managing provenance information for data processing pipelines

US12430068B2 · kind B2 · utility

0Cited by
0References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 29, 2019
Grant dateSep 30, 2025
Priority date
Expiry dateJun 9, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F11/3442
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for managing provenance information associated to one or more interconnected provenance entities in a provenance system for data processing pipelines in a distributed cloud environment over a network interface, wherein each of the data processing pipelines is configured to read in data, transform the data, and output transformed data is disclosed. The method comprises steps being performed by a configuration component of obtaining at least one declarative intent representing a configuration indicative of requirements and levels of priority for storage of provenance information for each of the data processing pipelines, deriving the requirements and levels of priority for storage of provenance information for each of the data processing pipelines based on the obtained at least one declarative intent, wherein one of the levels of priority—first level of priority—is higher than the other levels of priority—second levels of priority, estimating storage capacity for storage of provenance information in the provenance system based on the derived requirements and levels of priority, storing the provenance information according to the derived requirements and levels of priority fo…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.