Managing provenance information for data processing pipelines
US12430068B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 29, 2019 |
| Grant date | Sep 30, 2025 |
| Priority date | — |
| Expiry date | Jun 9, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F11/3442
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for managing provenance information associated to one or more interconnected provenance entities in a provenance system for data processing pipelines in a distributed cloud environment over a network interface, wherein each of the data processing pipelines is configured to read in data, transform the data, and output transformed data is disclosed. The method comprises steps being performed by a configuration component of obtaining at least one declarative intent representing a configuration indicative of requirements and levels of priority for storage of provenance information for each of the data processing pipelines, deriving the requirements and levels of priority for storage of provenance information for each of the data processing pipelines based on the obtained at least one declarative intent, wherein one of the levels of priority—first level of priority—is higher than the other levels of priority—second levels of priority, estimating storage capacity for storage of provenance information in the provenance system based on the derived requirements and levels of priority, storing the provenance information according to the derived requirements and levels of priority fo…
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.