Iterative set expansion using samples
US8589408B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 20, 2011 |
| Grant date | Nov 19, 2013 |
| Priority date | — |
| Expiry date | Jan 3, 2032 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/367
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A set expansion system is described herein that uses general-purpose web data to expand a set of seed entities. The system includes a simple yet effective quality metric to measure the expanded set, and includes two iterative thresholding processes to rank candidate entities. The system models web data sources and integrates relevance and coherence measurements to evaluate potential set candidates using an iterative process. The system uses general-purpose web data that is not specific to the given seeds. The system defines quality of the result set as the sum of two component scores: the relevance of a set of entities that measures their similarity with the given seeds, and the coherence of the set of entities produced which is how closely the entities in the set are related to each other. Based on this quality measure, the system develops a class of iterative set expansion processes.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.