Curating ambiguous data for use in a data pipeline through interaction with a data source
US12182098B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 29, 2023 |
| Grant date | Dec 31, 2024 |
| Priority date | — |
| Expiry date | Jun 29, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/215
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods and systems for curating data by a data manager are disclosed. Data may be curated from various data sources before being provided to downstream consumers that may rely on the trustworthiness of the curated data in order to provide desired computer-implemented services. During the data curation process, data curation resources are used to improve the trustworthiness and/or value of the collected data. However, data curation resources (e.g., data curators, computing resources) may be limited and/or insufficient to perform the data curation process as desired, which may result in unusable and/or uncurated (e.g., untrustworthy) data. Thus, the data may be screened for ambiguous values. A potential replacement value for each ambiguous value may be provided to the data source and the data source may indicate whether the potential replacement value should be used in the data pipeline as a final replacement value for the ambiguous value.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.