Systems, methods, and graphical user interfaces for training a code generation model for low-resource languages
US12277409B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 24, 2024 |
| Grant date | Apr 15, 2025 |
| Priority date | — |
| Expiry date | Sep 24, 2044 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F11/3612
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system, method, and computer-program product includes identifying a plurality of code synthesis items for a target programming language, generating a code synthesis prompt based on a first sampling of the plurality of code synthesis items, synthesizing, via a large language model, a plurality of raw code segments using the code synthesis prompt, executing the plurality of raw code segments with a code interpreter associated with the target programming language, determining one or more valid code segments of the plurality of raw code segments that the code interpreter successfully executed, aggregating, via a second sampling, the one or more valid code segments into one or more validated code synthesis training samples, and training a code generation model using the one or more validated code synthesis training samples. User interfaces may be provided to allow target coding tasks to be specified via text or speech.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.