Patent · US Active

Scalable, flexible and robust template-based data extraction pipeline

US11657631B2 · kind B2 · utility

4Cited by
3References
15Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 28, 2021
Grant dateMay 23, 2023
Priority date
Expiry dateNov 13, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A computer-implemented method for extracting information from a document, for example an official document, is disclosed. The method comprises acquiring an input image comprising a document portion; performing image segmentation on the input image to form a binary input image that distinguishes the document portion from the remaining portion of the input image; estimating a first image transform to align the binary input image to a binary template image, using the first image transform on the input image to form an intermediate image; estimating a second image transform to align the intermediate image to a template image; using the second image transform on the intermediate image to form an output image; and extracting a field from the output image using a predetermined field of the template image.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.