Spectral neighborhood blocking for entity resolution
US8719267B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Apr 19, 2010 |
| Grant date | May 6, 2014 |
| Priority date | — |
| Expiry date | Feb 5, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F18/231
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A processing device of an information processing system is operative to obtain a plurality of records, documents, web pages or other data objects, and to construct a binary tree using a bipartition procedure in which subsets of the data objects are associated with respective nodes of the tree. Evaluation of a designated modularity for a given one of the nodes of the tree is used as a stopping criterion to prevent further partitioning of that node and to indicate designation of that node as a leaf node of the tree. The resulting leaf nodes of the tree provide a non-overlapping partitioning of the plurality of data objects. The processing device is further operative to perform a neighborhood search on the tree to identify pairs of the plurality of data objects that match the same entity, and to store an indication of the matching pairs of data objects.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.