Patent · US Active

Automatic key/value pair extraction from document images using deep learning

US10896357B1 · kind B1 · utility

15Cited by
5References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 29, 2017
Grant dateJan 19, 2021
Priority date
Expiry dateFeb 19, 2038

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Key/Value pairs, each comprising a keyword string and an associated value, are extracted automatically from a document image. Each document image has a plurality of pixels with each pixel having a plurality of bits. A first subset of the plurality of bits for each pixel represents information corresponding to the document image. The document image is processed to add information to a second subset of the plurality of bits for each pixel. The information added to the second subset alters the appearance of the document image in a manner that facilitates semantic recognition of textually encoded segments within the document image by a Deep Neural Network (DNN) trained to recognize images within image documents. The DNN detects groupings of text segments within detected spatial templates within the document image. The text segments are mapped to known string values to generate the keyword strings and associated values.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.