Speech synthesis using one or more recurrent neural networks
US11069335B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 12, 2017 |
| Grant date | Jul 20, 2021 |
| Priority date | — |
| Expiry date | Jul 12, 2037 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L13/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Aspects of the disclosure are related to synthesizing speech or other audio based on input data. Additionally, aspects of the disclosure are related to using one or more recurrent neural networks. For example, a computing device may receive text input; may determine features based on the text input; may provide the features as input to an recurrent neural network; may determine embedded data from one or more activations of a hidden layer of the recurrent neural network; may determine speech data based on a speech unit search that attempts to select, from a database, speech units based on the embedded data; and may generate speech output based on the speech data.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.