Patent · US Active

Low resolution OCR for camera acquired documents

US7499588B2 · kind B2 · utility

65Cited by
14References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 20, 2004
Grant dateMar 3, 2009
Priority date
Expiry dateSep 16, 2026

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A global optimization framework for optical character recognition (OCR) of low-resolution photographed documents that combines a binarization-type process, segmentation, and recognition into a single process. The framework includes a machine learning approach trained on a large amount of data. A convolutional neural network can be employed to compute a classification function at multiple positions and take grey-level input which eliminates binarization. The framework utilizes preprocessing, layout analysis, character recognition, and word recognition to output high recognition rates. The framework also employs dynamic programming and language models to arrive at the desired output.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.