Patent · US Active

Optical character recognition system using multiple images and method of use

US9465774B2 · kind B2 · utility

5Cited by
1References
14Claims
0Family size

Inventor

Key dates

Filing dateMar 17, 2015
Grant dateOct 11, 2016
Priority date
Expiry dateMar 17, 2035

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Disclosed is an improved OCR system wherein the same can be utilized for capturing and analyzing multiple images of a document to increase the efficiency and accuracy of digitizing printed texts on the document. Captured images are merged into a single set of character recognition results via a recognition method from multiple images, which include early fusion, late fusion, and hybrid fusion embodiments. The end product from each of the embodiments provides text and metadata that include recognized words. In late and hybrid fusion, words having confidence scores above a predetermined threshold are assembled together to form paragraphs to reconstruct a digital version of the document. In this way, the present invention utilizes multiple images that can be combined to aggregate information and achieve high accuracy when scanning and digitizing printed texts.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.