Managing data profiling operations related to data type
US9971798B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Feb 19, 2015 |
| Grant date | May 15, 2018 |
| Priority date | — |
| Expiry date | Feb 24, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/24
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Processing data in a computing system includes receiving a plurality of records that each have one or more values for respective fields of a plurality of fields. Data type information associates each of one or more data types with at least one identifier. Processing a plurality of data values from the records includes: generating a plurality of data units from the records, each data unit including a field identifier that uniquely identifies one of the fields and a binary value from one of the records, the binary value extracted from the field of that record identified by the field identifier; aggregating information about binary values from a plurality of the data units; generating a list of entries for each of one or more of the fields, at least some of the entries each including one of the binary values and information about that binary value aggregated from a plurality of the data units; retrieving a data type associated with a first identifier from the data type information, and associating the retrieved data type with at least one binary value included in an entry of one of the lists; and generating profile information for at least one of the fields based at least in part on a…
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.