Optical character recognition system using multiple images and method of use
US9465774B2 · kind B2 · utility
Inventor
Key dates
| Filing date | Mar 17, 2015 |
| Grant date | Oct 11, 2016 |
| Priority date | — |
| Expiry date | Mar 17, 2035 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed is an improved OCR system wherein the same can be utilized for capturing and analyzing multiple images of a document to increase the efficiency and accuracy of digitizing printed texts on the document. Captured images are merged into a single set of character recognition results via a recognition method from multiple images, which include early fusion, late fusion, and hybrid fusion embodiments. The end product from each of the embodiments provides text and metadata that include recognized words. In late and hybrid fusion, words having confidence scores above a predetermined threshold are assembled together to form paragraphs to reconstruct a digital version of the document. In this way, the present invention utilizes multiple images that can be combined to aggregate information and achieve high accuracy when scanning and digitizing printed texts.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.