Method and system for linking heterogeneous data sources
US10235633B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 19, 2014 |
| Grant date | Mar 19, 2019 |
| Priority date | — |
| Expiry date | Sep 9, 2037 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N20/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for linking records (related to an entity) from separate databases may include extracting a first record from a first database as a first vector, extracting a second record from a second database as a second vector, generating first and second sub-vectors for the first and second vectors, where each sub-vector includes quality features from the respective vector, pre-processing the first and second sub-vectors using domain knowledge, calculating a distance assessment classifier based on the first and second sub-vectors, and determining whether the distance represented by the distance assessment classifier is greater than a threshold. If the distance is greater than the threshold, the records may be linked; if not, the method extracts additional records and repeats after generating first and second sub-vectors until the distance is greater than the threshold. A system for linking records is also disclosed.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.