Patent · US Active

Methods and systems of text extraction from images

US9412052B1 · kind B1 · utility

11Cited by
1References
22Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 23, 2015
Grant dateAug 9, 2016
Priority date
Expiry dateJun 23, 2035

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for extracting text from an image data is disclosed. The method includes pre-processing, via a processor, the image data to obtain a readable image data. The method further includes filtering, via the processor, a plurality of copies of the readable image data using a plurality of noise filters to obtain a corresponding plurality of noise removed images. Yet further, the method includes performing, via the processor, image data recognition on each of the plurality of noise removed images to obtain a text copy associated with each of the plurality of noise removed images. Moreover, the method includes ranking, via the processor, each word in the text copy associated with each of the plurality of noise removed images based on a predefined set of parameters. Finally, the method includes selecting, via the processor, highest ranked words within the text copy associated with each of the plurality of noise removed images to obtain output text for the image data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.