Patent · US Active

Normalizing ingested data sets based on fuzzy comparisons to known data sets

US9529863B1 · kind B1 · utility

41Cited by
67References
28Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 21, 2015
Grant dateDec 27, 2016
Priority date
Expiry dateDec 21, 2035

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/254
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Embodiments are directed towards normalizing ingested data sets based on fuzzy comparisons to known data sets. Raw data sets that each include raw records may be provided to an ingestion engine. Ingestion rules and known data sets may be provided based on the raw records. The ingestion engine may be employed to iteratively execute the ingestion rules. A comparison of the raw records to the known data sets may be performed. Contents of the raw records may be transformed into model record values and stored in model records. A score value that indicates a confidence level that the model records are correct may be provided. An association of the one or more ingestion rules used to transform the raw record contents into the model record values for each of the one or more model records may be added to a data model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.