Patent · US Active

Optical character recognition system using multiple images and method of use

US9465774B2 · kind B2 · utility

5Cited by

1References

14Claims

0Family size

Inventor

Benoit Maison · White Plains, US

Key dates

Filing date	Mar 17, 2015
Grant date	Oct 11, 2016
Priority date	—
Expiry date	Mar 17, 2035

Classification

Technology area (CPC G)Physics
CPC primaryG06V30/10
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Disclosed is an improved OCR system wherein the same can be utilized for capturing and analyzing multiple images of a document to increase the efficiency and accuracy of digitizing printed texts on the document. Captured images are merged into a single set of character recognition results via a recognition method from multiple images, which include early fusion, late fusion, and hybrid fusion embodiments. The end product from each of the embodiments provides text and metadata that include recognized words. In late and hybrid fusion, words having confidence scores above a predetermined threshold are assembled together to form paragraphs to reconstruct a digital version of the document. In this way, the present invention utilizes multiple images that can be combined to aggregate information and achieve high accuracy when scanning and digitizing printed texts.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.