Patent · US Active

Correcting segmentation errors in OCR

US7406201B2 · kind B2 · utility

8Cited by
2References
4Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 4, 2003
Grant dateJul 29, 2008
Priority date
Expiry dateAug 6, 2026

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for encoding characters includes identifying one or more sequences of the character codes that are likely to be generated due a segmentation error in application of a pattern recognition process, and associating a respective extension character code with each of the sequences. The area of an image containing characters is divided into segments, such that each segment contains approximately one character. The pattern recognition process is applied to each of the segments in order to generate an input string of character codes. At least one of the identified sequences of the character codes in the input string is replaced with the respective extension character code so as to generate a modified string. The output string is determined by comparing the modified string to a directory of known strings.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.