Patent · US Active

Identifying key-value pairs in documents

US11288719B2 · kind B2 · utility

4Cited by
3References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 27, 2020
Grant dateMar 29, 2022
Priority date
Expiry dateJun 24, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/414
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for converting unstructured documents to structured key-value pairs. In one aspect, a method comprises: providing an image of a document to a detection model, wherein: the detection model is configured to process the image to generate an output that defines one or more bounding boxes generated for the image; and each bounding box generated for the image is predicted to enclose a key-value pair comprising key textual data and value textual data, wherein the key textual data defines a label that characterizes the value textual data; and for each of the one or more bounding boxes generated for the image: identifying textual data enclosed by the bounding box using an optical character recognition technique; and determining whether the textual data enclosed by the bounding box defines a key-value pair.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.