Fast decoding in sequence models using discrete latent variables
US12353991B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 11, 2019 |
| Grant date | Jul 8, 2025 |
| Priority date | — |
| Expiry date | May 3, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N3/098
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating an output sequence from an input sequence. One of the methods includes receiving the input sequence; processing the input sequence using a latent prediction model configured to autoregressively predict a sequence of discrete latent variables that is shorter than the output sequence and that encodes the output sequence, wherein each discrete latent variable in the sequence is selected from a discrete set of latent variables; and processing the input sequence and the predicted sequence of discrete latent variables using a parallel decoder model configured to generate the outputs in the output sequence in parallel from the input sequence and the predicted sequence of discrete latent variables.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.