Identifying key-value pairs in documents
US11816710B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 1, 2022 |
| Grant date | Nov 14, 2023 |
| Priority date | — |
| Expiry date | Mar 1, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/414
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for converting unstructured documents to structured key-value pairs. In one aspect, a method includes: providing an image of a document to a detection model, wherein: the detection model is configured to process the image to generate an output that defines one or more bounding boxes generated for the image; and each bounding box generated for the image is predicted to enclose a key-value pair including key textual data and value textual data, wherein the key textual data defines a label that characterizes the value textual data; and for each of the one or more bounding boxes generated for the image: identifying textual data enclosed by the bounding box using an optical character recognition technique; and determining whether the textual data enclosed by the bounding box defines a key-value pair.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.