Using serial machine learning models to extract data from electronic documents
US11341354B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 30, 2020 |
| Grant date | May 24, 2022 |
| Priority date | — |
| Expiry date | Sep 30, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/18057
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for machine learning. One of the methods includes receiving a document having a plurality of first text strings; extracting the plurality of first text strings from the document; providing the extracted plurality of first text strings to a first machine learning model, wherein the first machine learning model is trained to output a numerical vector representation for each input first text string; providing the output vector representations from the first machine learning model to a second machine learning model, wherein the second machine learning model is trained to output a second text string for each input vector representation; and processing the second text strings to generate an output.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.