Patent · US Active

Generating vector representations of documents

US10366327B2 · kind B2 · utility

4Cited by
0References
18Claims
0Family size

Assignee

Inventor

Key dates

Filing dateJan 30, 2015
Grant dateJul 30, 2019
Priority date
Expiry dateFeb 14, 2036

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/0895
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating document vector representations. One of the methods includes obtaining a new document; and determining a vector representation for the new document using a trained neural network system, wherein the trained neural network system has been trained to receive an input document and a sequence of words from the input document and to generate a respective word score for each word in a set of words, wherein each of the respective word scores represents a predicted likelihood that the corresponding word follows a last word in the sequence in the input document, and wherein determining the vector representation for the new document using the trained neural network system comprises iteratively providing each of the plurality of sequences of words to the trained neural network system to determine the vector representation for the new document using gradient descent.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.