Patent · US Active

Using serial machine learning models to extract data from electronic documents

US11594057B1 · kind B1 · utility

0Cited by
9References
21Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 4, 2022
Grant dateFeb 28, 2023
Priority date
Expiry dateMay 4, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/18057
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for machine learning. One of the methods includes receiving a document having a plurality of first text strings; extracting the plurality of first text strings from the document; providing the extracted plurality of first text strings to a first machine learning model, wherein the first machine learning model is trained to output a numerical vector representation for each input first text string; providing the output vector representations from the first machine learning model to a second machine learning model, wherein the second machine learning model is trained to output a second text string for each input vector representation; and processing the second text strings to generate an output.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.