Method and apparatus for highlighting and categorizing documents using coded word tokens
US5526443A · kind A · utility
Assignees
Inventor
Key dates
| Filing date | Nov 9, 1995 |
| Grant date | Jun 11, 1996 |
| Priority date | — |
| Expiry date | Nov 9, 2015 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/313
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Highlighting and categorization of documents is carried out by using word tokens which represent words appearing in a document. Elimination of certain unimportant word tokens is first completed, after which the remaining words of the document are ranked according to their word token appearance rates. These rates are then used to highlight frequently appearing words in the document which indicate the document's topic. The document can also be categorized using document profiles developed from the word tokens.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.