Patent · US Expired

Methods for iteratively and interactively performing collection selection in full text searches

US6018733A · kind A · utility

149Cited by
8References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 12, 1997
Grant dateJan 25, 2000
Priority date
Expiry dateSep 12, 2017

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99943
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method of selecting the likely most relevant database collections for document searching based on an ad hoc query where each of the databases includes a plurality of documents. Iterative collection selection processing of the databases is performed to obtain consistent relative-ranking collection selection results for each iteration. The method uses a collection selection query and performs the repetitive steps of determining an inverse collection frequency and a document frequency for each database; determining a ranking value for each database; selecting a subset of the set of databases based on predetermined criteria dependant on the ranking value for each the database. The method provides for automated and manual descriptions, boolean selection terms combined with soft terms, and uses term proximity, capitalization, phraseology and other information in establishing a relevance ranking of the collections with respect to the ad hoc query.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.