Multicriteria record linkage with surrogate blocking keys
US11520764B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 27, 2019 |
| Grant date | Dec 6, 2022 |
| Priority date | — |
| Expiry date | Jan 28, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/2272
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A computer-implemented method and a related system for record linkage of an incoming record to a reference data set may be provided. The method comprises providing a reference data set comprising a plurality of records, each record comprising a plurality of attributes. The method comprises further assigning each of the plurality of records an initial surrogate identifier value, assigning a plurality of block identifiers to each of the records by applying a locality sensitive hashing function to a predefined attribute of the records, resulting in the plurality of the block identifiers, and determining a final surrogate identifier value to each of the records assigned to one of the blocks such that the final surrogate identifier values in each block are uniformly distributed.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.