Algorithm for the segmentation of printed fixed pitch documents
US4377803A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Jul 2, 1980 |
| Grant date | Mar 22, 1983 |
| Priority date | — |
| Expiry date | Jul 2, 2000 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An apparatus and method is provided for segmenting characters generated by an optical scanner. The apparatus also identifies underscores. The underscores are then masked and subsequent processing devices are informed of the existence of said underscores. Input video raster scans representative of a portion of a line of textual material are loaded into a video buffer. The video raster scans are broken up into a plurality of sections. The horizontal histogram (number of black pixel counts) associated with each section is determined. The baseline, vertical histogram and word location for each line of data to be segmented is determined. A find character unit finds the boundaries for each character. The character is sequentially transferred from the video buffer to a character output buffer.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.