Patent · US Expired

System and method for data extraction from digital images

US6400845B1 · kind B1 · utility

139Cited by
16References
13Claims
0Family size

Assignee

Inventor

Key dates

Filing dateApr 23, 1999
Grant dateJun 4, 2002
Priority date
Expiry dateApr 23, 2019

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99931
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and method of the extraction of textual data from a digital image using a data pattern comprised of visible and invisible characters to locate the data to be extracted and upon find such data populating the fields of an associated data base with the extracted visible data. The digital image to be processed is first compared against master document images contained in a database. Upon determining the proper master document image, a template having predefined data zone is applied to the image to create zone images. The zone images are optically read and converted into a character file which is then parsed with the pattern to locate the text to be extracted. Upon finding data matching the pattern, that data is extracted and the visible portions used to populate data fields in a database record associated with the digital image.In an alternate embodiment, if the extracted data cannot be successfully matched, a validation file of the unmatched data is created for review by an operator. In a further embodiment, if the scanned digital image cannot be matched with an existing master document image, a new master document image can be created from the unmatched digital image. In ano…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.