Patent · US Active

Shape clustering in post optical character recognition processing

US8111927B2 · kind B2 · utility

47Cited by
18References
5Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 20, 2010
Grant dateFeb 7, 2012
Priority date
Expiry dateMay 20, 2030

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems, methods and computer program products for shape clustering and applications in processing various documents, including an output of an optical character recognition (OCR) process. Clip images defined in a received OCR output are classified into a plurality of clusters of clip images. Clip images in each of the plurality of clusters are processed to generate a cluster image for each cluster. Shape differences between the cluster images of a first cluster and a second cluster and between the cluster images of the first cluster and a third cluster are used to determine a level of confidence in one or more first OCR character codes assigned to the first cluster.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.