Method and apparatus for performing optical character recognition (OCR) and text stitching
US7343049B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Mar 7, 2002 |
| Grant date | Mar 11, 2008 |
| Priority date | — |
| Expiry date | Mar 30, 2024 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V10/16
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method of generating an electronic text file from a paper-based document that includes a plurality of characters includes capturing a plurality of partially overlapping digital images of the document. Optical character recognition is performed on each one of the plurality of captured digital images, thereby generating a corresponding plurality of electronic text files. Each one of the electronic text files includes a portion of the plurality of characters in the document. The plurality of electronic text files are compared with one another to identify characters that are in common between the electronic text files. The plurality of electronic text files are combined into a combined text file based on the comparison. The combined text file includes the plurality of characters in the document.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.