Patent · US Active

Automated form understanding via layout agnostic identification of keys and corresponding values

US10878234B1 · kind B1 · utility

11Cited by
0References
21Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 20, 2018
Grant dateDec 29, 2020
Priority date
Expiry dateFeb 6, 2039

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04L67/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques for automated form understanding via layout-agnostic identification of keys and corresponding values are described. An embedding generator creates embeddings of pixels from an image including a representation of a form. The generated embeddings are similar for pixels within a same key-value unit, and far apart for pixels not in a same key-value unit. A weighted bipartite graph is constructed including a first set of nodes corresponding to keys of the form and a second set of nodes corresponding to values of the form. Weights for the edges are determined based on an analysis of distances between ones of the embeddings. The graph is partitioned according to a scheme to identify pairings between the first set of nodes and the second set of nodes that produces a minimum overall edge weight. The pairings indicate keys and values that are associated within the form.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.