Patent · US Active

Automated segmentation tuner

US9070011B2 · kind B2 · utility

1Cited by
13References
17Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 17, 2011
Grant dateJun 30, 2015
Priority date
Expiry dateFeb 29, 2032

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V20/68
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

A system, method, and computer program product are provided for automatically segmenting input document images into regions of black text, white space, and image content. A set of scanned training documents representing the range of text and images to be processed is coarsely tagged to classify regions by content type. The training images are divided into bricks, parameters describing individual brick features are evaluated, and the bricks are classified according to the parameter values. A classification map that relates parameter values to classification codes describing content type is constructed by generating linear equations separating a parameter space into parameter regions along classification boundaries. After training, input documents are scanned and divided into bricks, and brick parameters are converted into an index into the classification map, to classify document regions by content.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.