Patent · US Expired

Method and apparatus for content identification and categorization of textual data

US6363174B1 · kind B1 · utility

44Cited by
5References
24Claims
0Family size

Assignees

Inventor

Key dates

Filing dateDec 28, 1998
Grant dateMar 26, 2002
Priority date
Expiry dateDec 28, 2018

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99936
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method and an apparatus for content identification and categorization of textual data is disclosed. Using the Burrows-Wheeler transform in conjunction with mapping techniques and statistical comparison, useful information can be extracted from textual documents. This information can be used to categorize, authenticate, and compare such documents, thereby leading to automated searching of databases of documents.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.