Segmentation of text, picture and lines of a document image
US5335290A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Apr 6, 1992 |
| Grant date | Aug 2, 1994 |
| Priority date | — |
| Expiry date | Apr 6, 2012 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06T9/005
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
In a character recognition system, a method and apparatus for segmenting a document image into areas containing text and non-text. Document segmentation in the present invention is comprised generally of the steps of: providing a bit-mapped representation of the document image, extracting run lengths for each scanline from the bit-mapped representation of the document image; constructing rectangles from the run lengths; initially classifying each of the rectangles as either text or non-text; correcting for the skew in the rectangles; merging associated text into one or more text blocks; and logically ordering the text blocks.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.