Patent · US Active

Generating and using joint representations of source code

US11169786B2 · kind B2 · utility

3Cited by
2References
12Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 4, 2020
Grant dateNov 9, 2021
Priority date
Expiry dateFeb 4, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/044
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Implementations are described herein for generating embeddings of source code using both the language and graph domains, and leveraging combinations of these semantically-rich and structurally-informative embeddings for various purposes. In various implementations, tokens of a source code snippet may be applied as input across a sequence-processing machine learning model to generate a plurality of token embeddings. A graph may also be generated based on the source code snippet. A joint representation may be generated based on the graph and the incorporated token embeddings. The joint representation generated from the source code snippet may be compared to one or more other joint representations generated from one or more other source code snippets to make a determination about the source code snippet.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.