Content delineation in document images
US9798924B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Aug 17, 2015 |
| Grant date | Oct 24, 2017 |
| Priority date | — |
| Expiry date | Dec 29, 2035 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/2504
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods and apparatus delineate grouped together content in documents. Void and unvoid pixels in document images get clustered together. Execution of a histogram and autocorrelation function, including peak detection, against the unvoid clusters reveals the content. Techniques for clustering include iteratively transforming an original image into secondary images with a Haar wavelet transformation, for example. Clustering begins on a lowest image plane and advances to a next highest plane until all void and unvoid pixels in the images are grouped. Void clusters at lower levels remain void clusters at higher levels, thus only unvoid clusters of pixels require processing at higher levels thereby optimizing processing. Imaging devices with scanners define suitable hardware for transformation of the document into images and processors with executable code cluster together pixels to delineate content. Further processing includes executing OCR or other routines post void/unvoid analysis.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.