Patent · US Expired

Word importance calculation method, document retrieving interface, word dictionary making method

US6850937B1 · kind B1 · utility

17Cited by
16References
8Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 22, 2000
Grant dateFeb 1, 2005
Priority date
Expiry dateNov 5, 2022

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99945
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A known method for selecting words (or word sequences), which is an important aspect of information retrieval, involves the problems of inability to eliminate high-frequency common words and of often arbitrary setting of the threshold value for dividing important and unimportant words. These problems are solved by normalizing the difference between the word distribution in a subset of all documents containing a word to be extracted (or a subset of said document set) and the word distribution in the set of all documents with the number of words in the said subset of all documents containing the word as a parameter, and the accuracy of support information retrieval is thereby enhanced.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.