Patent · US Expired

Extracting information from symbolically compressed document images

US6658151B2 · kind B2 · utility

89Cited by
19References
41Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 8, 1999
Grant dateDec 2, 2003
Priority date
Expiry dateApr 8, 2019

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99939
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method and apparatus for extracting information from symbolically compressed document images. A deciphering module generates first and second text strings by deciphering respective sequences of template identifiers in first and second symbolically compressed document images. A conditional n-gram module receives the first and second text strings from the deciphering module and extracts n-gram terms therefrom based on a predicate condition. A comparison module generates a measure of similarity between the first and second symbolically compressed document images based on the n-gram terms extracted by the conditional n-gram module.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.