Patent · US Active

Lower modifier detection and extraction from devanagari text images to improve OCR performance

US9064191B2 · kind B2 · utility

0Cited by
46References
22Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 8, 2013
Grant dateJun 23, 2015
Priority date
Expiry dateSep 13, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/293
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems, apparatus and methods for extracting lower modifiers from a word image, before performing optical character recognition (OCR), based on a plurality of tests comprising a first test, a second test and a third test are presented. The method obtains the word image and performing a plurality of tests (e.g., a first test, a second test and a third test). The first test determines whether a vertical line spanning the height of the word image exists. The second test determines whether a jump of a number of components in the lower portion of the word image exists. The third test determines sparseness in a lower portion of the word image. The plurality of tests may run sequentially and/or in parallel. Results from the plurality of tests are used to decide whether a lower modifier exists by comparing and accumulating test results from the plurality of tests.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.