Selecting candidate rows for deduplication
US8719236B2 · kind B2 · utility
1Cited by
4References
20Claims
0Family size
Assignee
Inventors
Key dates
| Filing date | Aug 23, 2012 |
| Grant date | May 6, 2014 |
| Priority date | — |
| Expiry date | Aug 23, 2032 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/1727
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The present invention extends to methods, systems, and computer program products for selecting candidate records for deduplication from a table. A table can be processed to compute an inverse index for each field of the table. A deduplication algorithm can traverse the inverse indices in accordance with a flexible user-defined policy to identify candidate records for deduplication. Both exact matches and approximate matches can be found.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.