Efficient retrieval algorithm by query term discrimination
US7925644B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 27, 2008 |
| Grant date | Apr 12, 2011 |
| Priority date | — |
| Expiry date | Jun 10, 2029 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/334
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and system for use in information retrieval includes, for each of a plurality of terms, selecting a predetermined number of top scoring documents for the term to form a corresponding document set for the term. When a plurality of terms are received, optionally as a query, the system ranks, using an inverse document frequency algorithm, the plurality of terms for importance based on the document sets for the plurality of terms. Then a number of ranked terms are selected based on importance and a union set is formed based on the document sets associated with the selected number of ranked terms.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.