Selecting candidate rows for deduplication
US9547664B2 · kind B2 · utility
4Cited by
6References
18Claims
0Family size
Assignee
Inventors
Key dates
| Filing date | May 1, 2014 |
| Grant date | Jan 17, 2017 |
| Priority date | — |
| Expiry date | May 1, 2034 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/1727
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The present invention extends to methods, systems, and computer program products for selecting candidate records for deduplication from a table. A table can be processed to compute an inverse index for each field of the table. A deduplication algorithm can traverse the inverse indices in accordance with a flexible user-defined policy to identify candidate records for deduplication. Both exact matches and approximate matches can be found.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.