Generating vector representations of documents
US10366327B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Jan 30, 2015 |
| Grant date | Jul 30, 2019 |
| Priority date | — |
| Expiry date | Feb 14, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N3/0895
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating document vector representations. One of the methods includes obtaining a new document; and determining a vector representation for the new document using a trained neural network system, wherein the trained neural network system has been trained to receive an input document and a sequence of words from the input document and to generate a respective word score for each word in a set of words, wherein each of the respective word scores represents a predicted likelihood that the corresponding word follows a last word in the sequence in the input document, and wherein determining the vector representation for the new document using the trained neural network system comprises iteratively providing each of the plurality of sequences of words to the trained neural network system to determine the vector representation for the new document using gradient descent.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.