Computer vision based document parsing
US12094231B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 1, 2021 |
| Grant date | Sep 17, 2024 |
| Priority date | — |
| Expiry date | Oct 12, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/414
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for machine learning. One of the methods includes receiving one or more page images from a document; for each page image: providing the page image to a computer vision neural network model, wherein the neural network model is trained for the particular page type and is configured to output predictions of coordinates for one or more regions within the image and corresponding labels for the respective regions; and generating an output data structure associating each labeled region with text content located within the labeled region.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.