Method for classifying an unmanaged dataset
US10055430B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 14, 2015 |
| Grant date | Aug 21, 2018 |
| Priority date | — |
| Expiry date | Oct 19, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/285
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A computer implemented method for classifying at least one source dataset of a computer system. The method may include providing a plurality of associated reference tables organized and associated in accordance with a reference storage model in the computer system. The method may also include calculating, by a data classifier application of the computer system, a first similarity score between the source dataset and a first reference table of the reference tables based on common attributes in the source dataset and a join of the first reference table with at least one further reference table of the reference tables having a relationship with the first reference table. The method may further include classifying, by the data classifier application, the source dataset by determining using at least the calculated first similarity score whether the source dataset is organized as the first reference table in accordance to the reference storage model.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.