Patent · US Expired

OCR-based image compression

US6487311B1 · kind B1 · utility

8Cited by
6References
15Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 4, 1999
Grant dateNov 26, 2002
Priority date
Expiry dateMay 4, 2019

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04N1/4115
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

A method for compressing a digitized image of a document using optical character recognition (OCR). The method includes performing optical character recognition (OCR) on the digitized image, identifying, based, at least in part, on a result of the performing step, a plurality of classes of characters comprised in the image, each the class of characters having an associated character value and comprising at least one character, pruning each class of characters, thereby producing information describing the plurality of classes of characters and a residual image, and utilizing the information describing the plurality of classes of characters and the residual image as a compressed digitized image in further processing.Related methods and apparatus are also disclosed.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.