Text detection using multi-layer connected components with histograms
US8611662B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 21, 2011 |
| Grant date | Dec 17, 2013 |
| Priority date | — |
| Expiry date | Nov 21, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A digital image is converted to a multiple level image, and multiple scale sets are formed from connected components of the multiple level image such that different ones of the scale sets define different size spatial bins. For each of the multiple scale sets there is generated a count of connected components extracted from the respective scale set for each spatial bin; and adjacent spatial bins which represent connected components are linked. Then the connected components from the different scale sets are merged and text line detection is performed on the merged connected components. In one embodiment each of the scale sets is a histogram, and prior to linking all bins with less than a predetermined count are filtered out; and each histogram is extended such that counts of adjacent horizontal and vertical bins are added (single region bins are filtered out) and the linking is on the extended histograms.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.