High precision set expansion for large concepts
US9547718B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 14, 2011 |
| Grant date | Jan 17, 2017 |
| Priority date | — |
| Expiry date | Apr 24, 2032 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06Q30/0201
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A set expansion system is described herein that improves precision, recall, and performance of prior set expansion methods for large sets of data. The system maintains high precision and recall by 1) identifying the qualify of particular lists and applying that quality through a weight, 2) allowing for the specification or negative examples in a set of seeds to reduce the introduction of bad entities into the set, and 3) applying a cutoff to eliminate lists that include a low number of positive matches. The system may perform multiple passes to first generate a good candidate result set and then refine the set to find a set with highest quality. The system may also apply Map Reduce or other distributed processing techniques to allow calculation in parallel. Thus, the system efficiently expands large concept sets from a potentially small set of initial seeds from readily available web data.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.