System and method for interpreting document contents
US6772170B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 16, 2002 |
| Grant date | Aug 3, 2004 |
| Priority date | — |
| Expiry date | Nov 16, 2022 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99943
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A sequence of word filters are used to eliminate terms in the database which do not discriminate document content, resulting in a filtered word set and a topic word set whose members are highly predictive of content. These two word sets are then formed into a two dimensional matrix with matrix entries calculated as the conditional probability that a document will contain a word in a row given that it contains the word in a column. The matrix representation allows the resultant vectors to be utilized to interpret document contents.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.