Patent · US Active

Managing data profiling operations related to data type

US9971798B2 · kind B2 · utility

2Cited by
39References
55Claims
0Family size

Assignee

Inventor

Key dates

Filing dateFeb 19, 2015
Grant dateMay 15, 2018
Priority date
Expiry dateFeb 24, 2036

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/24
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Processing data in a computing system includes receiving a plurality of records that each have one or more values for respective fields of a plurality of fields. Data type information associates each of one or more data types with at least one identifier. Processing a plurality of data values from the records includes: generating a plurality of data units from the records, each data unit including a field identifier that uniquely identifies one of the fields and a binary value from one of the records, the binary value extracted from the field of that record identified by the field identifier; aggregating information about binary values from a plurality of the data units; generating a list of entries for each of one or more of the fields, at least some of the entries each including one of the binary values and information about that binary value aggregated from a plurality of the data units; retrieving a data type associated with a first identifier from the data type information, and associating the retrieved data type with at least one binary value included in an entry of one of the lists; and generating profile information for at least one of the fields based at least in part on a…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.