Patent · US Expired

Segmentation of text, picture and lines of a document image

US5335290A · kind A · utility

149Cited by
6References
10Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 6, 1992
Grant dateAug 2, 1994
Priority date
Expiry dateApr 6, 2012

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06T9/005
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

In a character recognition system, a method and apparatus for segmenting a document image into areas containing text and non-text. Document segmentation in the present invention is comprised generally of the steps of: providing a bit-mapped representation of the document image, extracting run lengths for each scanline from the bit-mapped representation of the document image; constructing rectangles from the run lengths; initially classifying each of the rectangles as either text or non-text; correcting for the skew in the rectangles; merging associated text into one or more text blocks; and logically ordering the text blocks.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.