Patent · US Active

Low resolution OCR for camera acquired documents

US7499588B2 · kind B2 · utility

65Cited by

14References

20Claims

0Family size

Assignee

Microsoft Corporation · US

Inventors

Charles E. Jacobs · Seattle, US
James Rinker · Kirkland, US
Patrice Y. Simard · Bellevue, US
Paul Viola · Seattle, US

Key dates

Filing date	May 20, 2004
Grant date	Mar 3, 2009
Priority date	—
Expiry date	Sep 16, 2026

Classification

Technology area (CPC G)Physics
CPC primaryG06V30/10
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A global optimization framework for optical character recognition (OCR) of low-resolution photographed documents that combines a binarization-type process, segmentation, and recognition into a single process. The framework includes a machine learning approach trained on a large amount of data. A convolutional neural network can be employed to compute a classification function at multiple positions and take grey-level input which eliminates binarization. The framework utilizes preprocessing, layout analysis, character recognition, and word recognition to output high recognition rates. The framework also employs dynamic programming and language models to arrive at the desired output.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.