Patent · US Active

Using serial machine learning models to extract data from electronic documents

US11341354B1 · kind B1 · utility

2Cited by
8References
15Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 30, 2020
Grant dateMay 24, 2022
Priority date
Expiry dateSep 30, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/18057
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for machine learning. One of the methods includes receiving a document having a plurality of first text strings; extracting the plurality of first text strings from the document; providing the extracted plurality of first text strings to a first machine learning model, wherein the first machine learning model is trained to output a numerical vector representation for each input first text string; providing the output vector representations from the first machine learning model to a second machine learning model, wherein the second machine learning model is trained to output a second text string for each input vector representation; and processing the second text strings to generate an output.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.