Patent · US Expired

Algorithm for the segmentation of printed fixed pitch documents

US4377803A · kind A · utility

58Cited by
9References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 2, 1980
Grant dateMar 22, 1983
Priority date
Expiry dateJul 2, 2000

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

An apparatus and method is provided for segmenting characters generated by an optical scanner. The apparatus also identifies underscores. The underscores are then masked and subsequent processing devices are informed of the existence of said underscores. Input video raster scans representative of a portion of a line of textual material are loaded into a video buffer. The video raster scans are broken up into a plurality of sections. The horizontal histogram (number of black pixel counts) associated with each section is determined. The baseline, vertical histogram and word location for each line of data to be segmented is determined. A find character unit finds the boundaries for each character. The character is sequentially transferred from the video buffer to a character output buffer.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.