Systems and methods for automated data quality semantic constraint identification using rich data type inferences
US12105687B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 27, 2022 |
| Grant date | Oct 1, 2024 |
| Priority date | — |
| Expiry date | Oct 27, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/2365
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Systems and methods for automated data quality semantic constraint identification using rich data type inferences are disclosed. In one embodiment, a method for automated data quality analysis may include: (1) receiving, by a data quality engine computer program, reference data from a data source, wherein the reference data comprises a plurality of columns; (2) inferring, by the data quality engine computer program, a rich data type for each of the plurality of columns, wherein the rich data type has a specific format, a content constraint, and/or a specific application; (3) applying, by the data quality engine computer program, a data quality constraint to each column based on the rich data type for the column; (4) updating, by the data quality engine computer program, the reference data with production data; and (5) identifying, by the data quality engine computer program, a data quality issue in the production data.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.