Patent · US Expired

System and method for interpreting document contents

US6772170B2 · kind B2 · utility

60Cited by
24References
15Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 16, 2002
Grant dateAug 3, 2004
Priority date
Expiry dateNov 16, 2022

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99943
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A sequence of word filters are used to eliminate terms in the database which do not discriminate document content, resulting in a filtered word set and a topic word set whose members are highly predictive of content. These two word sets are then formed into a two dimensional matrix with matrix entries calculated as the conditional probability that a document will contain a word in a row given that it contains the word in a column. The matrix representation allows the resultant vectors to be utilized to interpret document contents.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.