Patent · US Active

Speech synthesis using one or more recurrent neural networks

US11069335B2 · kind B2 · utility

7Cited by
8References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 12, 2017
Grant dateJul 20, 2021
Priority date
Expiry dateJul 12, 2037

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L13/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Aspects of the disclosure are related to synthesizing speech or other audio based on input data. Additionally, aspects of the disclosure are related to using one or more recurrent neural networks. For example, a computing device may receive text input; may determine features based on the text input; may provide the features as input to an recurrent neural network; may determine embedded data from one or more activations of a hidden layer of the recurrent neural network; may determine speech data based on a speech unit search that attempts to select, from a database, speech units based on the embedded data; and may generate speech output based on the speech data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.