Patent · US Active

Content delineation in document images

US9798924B2 · kind B2 · utility

0Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 17, 2015
Grant dateOct 24, 2017
Priority date
Expiry dateDec 29, 2035

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/2504
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods and apparatus delineate grouped together content in documents. Void and unvoid pixels in document images get clustered together. Execution of a histogram and autocorrelation function, including peak detection, against the unvoid clusters reveals the content. Techniques for clustering include iteratively transforming an original image into secondary images with a Haar wavelet transformation, for example. Clustering begins on a lowest image plane and advances to a next highest plane until all void and unvoid pixels in the images are grouped. Void clusters at lower levels remain void clusters at higher levels, thus only unvoid clusters of pixels require processing at higher levels thereby optimizing processing. Imaging devices with scanners define suitable hardware for transformation of the document into images and processors with executable code cluster together pixels to delineate content. Further processing includes executing OCR or other routines post void/unvoid analysis.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.