Patent · US Active

Data classification

US11748382B2 · kind B2 · utility

3Cited by
5References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 18, 2020
Grant dateSep 5, 2023
Priority date
Expiry dateDec 17, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N20/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method provides for classifying data fields of a dataset. A classifier configured for determining confidence values for a plurality of data classes for the data fields may be applied. Using the confidence values, data class candidates may be identified. Data fields may be determined for which a plurality of data class candidates is identifiable. Using previous user-selected data class assignments, a probability may be determined for the data class candidates that the respective data class candidate is a data class to which the respective data field is to be assigned. The data fields may be classified using the probabilities to select for the data fields a data class from the data class candidates. The dataset may be provided with metadata identifying for the data fields the data classes to which the respective data fields are assigned.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.