Methods for iteratively and interactively performing collection selection in full text searches
US6018733A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Sep 12, 1997 |
| Grant date | Jan 25, 2000 |
| Priority date | — |
| Expiry date | Sep 12, 2017 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99943
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method of selecting the likely most relevant database collections for document searching based on an ad hoc query where each of the databases includes a plurality of documents. Iterative collection selection processing of the databases is performed to obtain consistent relative-ranking collection selection results for each iteration. The method uses a collection selection query and performs the repetitive steps of determining an inverse collection frequency and a document frequency for each database; determining a ranking value for each database; selecting a subset of the set of databases based on predetermined criteria dependant on the ranking value for each the database. The method provides for automated and manual descriptions, boolean selection terms combined with soft terms, and uses term proximity, capitalization, phraseology and other information in establishing a relevance ranking of the collections with respect to the ad hoc query.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.