Generating vector representations of documents
US10803380B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 12, 2016 |
| Grant date | Oct 13, 2020 |
| Priority date | — |
| Expiry date | Jul 4, 2038 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N20/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating document vector representations. One of the methods includes obtaining a new document; selecting a plurality of new document word sets; and determining a vector representation for the new document using a trained neural network system, wherein the trained neural network system comprises: a document embedding layer and a classifier, and wherein determining the vector representation for the new document using the trained neural network system comprises iteratively providing each of the plurality of new document word sets to the trained neural network system to determine the vector representation for the new document using gradient descent.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.