Patent · US Active

Systems, methods, and graphical user interfaces for training a code generation model for low-resource languages

US12277409B1 · kind B1 · utility

2Cited by
0References
30Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 24, 2024
Grant dateApr 15, 2025
Priority date
Expiry dateSep 24, 2044

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F11/3612
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system, method, and computer-program product includes identifying a plurality of code synthesis items for a target programming language, generating a code synthesis prompt based on a first sampling of the plurality of code synthesis items, synthesizing, via a large language model, a plurality of raw code segments using the code synthesis prompt, executing the plurality of raw code segments with a code interpreter associated with the target programming language, determining one or more valid code segments of the plurality of raw code segments that the code interpreter successfully executed, aggregating, via a second sampling, the one or more valid code segments into one or more validated code synthesis training samples, and training a code generation model using the one or more validated code synthesis training samples. User interfaces may be provided to allow target coding tasks to be specified via text or speech.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.