Word importance calculation method, document retrieving interface, word dictionary making method
US6850937B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Aug 22, 2000 |
| Grant date | Feb 1, 2005 |
| Priority date | — |
| Expiry date | Nov 5, 2022 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99945
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A known method for selecting words (or word sequences), which is an important aspect of information retrieval, involves the problems of inability to eliminate high-frequency common words and of often arbitrary setting of the threshold value for dividing important and unimportant words. These problems are solved by normalizing the difference between the word distribution in a subset of all documents containing a word to be extracted (or a subset of said document set) and the word distribution in the set of all documents with the number of words in the said subset of all documents containing the word as a parameter, and the accuracy of support information retrieval is thereby enhanced.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.