Shape clustering in post optical character recognition processing
US8111927B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 20, 2010 |
| Grant date | Feb 7, 2012 |
| Priority date | — |
| Expiry date | May 20, 2030 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Systems, methods and computer program products for shape clustering and applications in processing various documents, including an output of an optical character recognition (OCR) process. Clip images defined in a received OCR output are classified into a plurality of clusters of clip images. Clip images in each of the plurality of clusters are processed to generate a cluster image for each cluster. Shape differences between the cluster images of a first cluster and a second cluster and between the cluster images of the first cluster and a third cluster are used to determine a level of confidence in one or more first OCR character codes assigned to the first cluster.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.