Method and apparatus for content identification and categorization of textual data
US6363174B1 · kind B1 · utility
44Cited by
5References
24Claims
0Family size
Assignees
Inventor
Key dates
| Filing date | Dec 28, 1998 |
| Grant date | Mar 26, 2002 |
| Priority date | — |
| Expiry date | Dec 28, 2018 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99936
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and an apparatus for content identification and categorization of textual data is disclosed. Using the Burrows-Wheeler transform in conjunction with mapping techniques and statistical comparison, useful information can be extracted from textual documents. This information can be used to categorize, authenticate, and compare such documents, thereby leading to automated searching of databases of documents.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.