Patent · US Active

Finding repeated structure for data extraction from document images

US8625886B2 · kind B2 · utility

3Cited by
0References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 8, 2011
Grant dateJan 7, 2014
Priority date
Expiry dateAug 28, 2031

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/40
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods and system employing the same for finding repeated structure for data extraction from document images are provided. A reference record and one or more reference fields thereof are identified from a document image. One or more candidate fields are generated for each of the reference fields. One or more best candidate records from the candidate fields are selected using a probabilistic model and an optimal record set is determined from the best candidate records.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.