Normalizing ingested data sets based on fuzzy comparisons to known data sets
US9529863B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 21, 2015 |
| Grant date | Dec 27, 2016 |
| Priority date | — |
| Expiry date | Dec 21, 2035 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/254
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Embodiments are directed towards normalizing ingested data sets based on fuzzy comparisons to known data sets. Raw data sets that each include raw records may be provided to an ingestion engine. Ingestion rules and known data sets may be provided based on the raw records. The ingestion engine may be employed to iteratively execute the ingestion rules. A comparison of the raw records to the known data sets may be performed. Contents of the raw records may be transformed into model record values and stored in model records. A score value that indicates a confidence level that the model records are correct may be provided. An association of the one or more ingestion rules used to transform the raw record contents into the model record values for each of the one or more model records may be added to a data model.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.