Method and means for enhancing optical character recognition of printed documents
US5748807A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Oct 15, 1993 |
| Grant date | May 5, 1998 |
| Priority date | — |
| Expiry date | Oct 15, 2013 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A document marker, including first values dependent upon the layout and the contents of the document and assigned by generating or preprocessing software, is provided in machine-readable symbology on the face of a printed version of the document. The marker may include encoded document layout information and values assigned on sequences of the original text, including text-dependent decimation sequences, error correction codes or check-sums. Upon optical character recognition scanning, or other digitizing reproduction, the marker is also scanned. The scanning computer, having corresponding software, assigns second values dependent upon the layout and contents of the reproduced document. Upon comparison of the first and second decimation sequences, line and character errors can be detected and some errors corrected, thereby generating re-aligned candidate sequences. Optional error correction codes can provide further correcting capabilities, as applied to the re-aligned reproduced document sequences; and, an optional check-sum comparison can be utilized to verify the accuracy of the reproduced sequences are correct.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.