Patent · US Active

Computer vision based document parsing

US12094231B1 · kind B1 · utility

0Cited by
4References
21Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 1, 2021
Grant dateSep 17, 2024
Priority date
Expiry dateOct 12, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/414
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for machine learning. One of the methods includes receiving one or more page images from a document; for each page image: providing the page image to a computer vision neural network model, wherein the neural network model is trained for the particular page type and is configured to output predictions of coordinates for one or more regions within the image and corresponding labels for the respective regions; and generating an output data structure associating each labeled region with text content located within the labeled region.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.