OCR-based image compression
US6487311B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | May 4, 1999 |
| Grant date | Nov 26, 2002 |
| Priority date | — |
| Expiry date | May 4, 2019 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04N1/4115
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
A method for compressing a digitized image of a document using optical character recognition (OCR). The method includes performing optical character recognition (OCR) on the digitized image, identifying, based, at least in part, on a result of the performing step, a plurality of classes of characters comprised in the image, each the class of characters having an associated character value and comprising at least one character, pruning each class of characters, thereby producing information describing the plurality of classes of characters and a residual image, and utilizing the information describing the plurality of classes of characters and the residual image as a compressed digitized image in further processing.Related methods and apparatus are also disclosed.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.