Pattern recognition process for text document interpretation
US6792145B2 · kind B2 · utility
Inventor
Key dates
| Filing date | Jun 8, 2001 |
| Grant date | Sep 14, 2004 |
| Priority date | — |
| Expiry date | Sep 4, 2021 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06Q40/12
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The present invention is directed to a method for extracting textual as well as tabular data material from financial documents. Initially, a comparison is made to determine the type of data schedule material provided in the document. Subsequently, the character strings of the financial document are compared to character strings provided in previous documents or in various databases. The database of the previous document would include the textual material in a first plane, and the tabular material also in that first plane. If a character string match is made between a new document and an old document, the new tabular data material would be provided in a data matrix in a second plane but the corresponding textual material would not be included in the textual matrix provided in that second plane. Only character strings not matched in the textual material of the first plane would be provided in the textual matrix of the second plane.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.