Patent · US Active

Classifying an unmanaged dataset

US10592481B2 · kind B2 · utility

5Cited by
0References
14Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 6, 2017
Grant dateMar 17, 2020
Priority date
Expiry dateApr 20, 2038

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/285
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A computer implemented method for classifying at least one source dataset of a computer system. The method may include providing a plurality of associated reference tables organized and associated in accordance with a reference storage model in the computer system. The method may also include calculating, by a data classifier application of the computer system, a first similarity score between the source dataset and a first reference table of the reference tables based on common attributes in the source dataset and a join of the first reference table with at least one further reference table of the reference tables having a relationship with the first reference table. The method may further include classifying, by the data classifier application, the source dataset by determining using at least the calculated first similarity score whether the source dataset is organized as the first reference table in accordance to the reference storage model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.