Method of text information recognition from a graphical file with use of dictionaries and other supplementary data
US7734065B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 6, 2006 |
| Grant date | Jun 8, 2010 |
| Priority date | — |
| Expiry date | Apr 4, 2029 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/40
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The present invention deals with text comprising image parsed to graphemes. A result of character recognition is creation of one or more versions of characters for each grapheme. All possible words versions are obtained using all characters versions, and all parsing versions are examined. A supplementary data of several types is applied successively in the preliminarily prescribed order to the examined words. The processing with the use of supplemental data may be represented as a three times repeated processing of the same text fragment with the use of supplementary information becoming available at each time. The examination comprises three steps. 1) A set of chains LPG is built using all obtained recognized grapheme-to-character versions. 2) All obtained versions are analyzed with the successive application of subsequent supplemental data types in connection with the preliminarily assigned order or with a joint application thereof. 3) A supplementary space recognition correction.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.