Patent · US Active

Systems and methods for automated data quality semantic constraint identification using rich data type inferences

US12105687B2 · kind B2 · utility

0Cited by
3References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 27, 2022
Grant dateOct 1, 2024
Priority date
Expiry dateOct 27, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/2365
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems and methods for automated data quality semantic constraint identification using rich data type inferences are disclosed. In one embodiment, a method for automated data quality analysis may include: (1) receiving, by a data quality engine computer program, reference data from a data source, wherein the reference data comprises a plurality of columns; (2) inferring, by the data quality engine computer program, a rich data type for each of the plurality of columns, wherein the rich data type has a specific format, a content constraint, and/or a specific application; (3) applying, by the data quality engine computer program, a data quality constraint to each column based on the rich data type for the column; (4) updating, by the data quality engine computer program, the reference data with production data; and (5) identifying, by the data quality engine computer program, a data quality issue in the production data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.