Patent · US Active

End-to-end text-to-speech conversion

US10573293B2 · kind B2 · utility

3Cited by

1References

20Claims

0Family size

Assignee

Google LLC · US

Inventors

Samuel Bengio · Los Altos, US
Yuxuan Wang · 安丰镇, CN
Zongheng Yang · Berkeley, US
Zhifeng Chen · Sunnyvale, US
Yonghui Wu · Fremont, US
Ioannis Agiomyrgiannakis · London, GB
Ron J. Weiss · New York, US
Navdeep Jaitly · Mountain View, US
Ryan M. Rifkin · Oakland, US
Robert Andrew James Clark · Stapleford, GB
Quoc V. Le · Stanford, US
Russell J. Ryan · Mountain View, US
Ying Xiao · San Bruno, US

Key dates

Filing date	Jun 20, 2019
Grant date	Feb 25, 2020
Priority date	—
Expiry date	Jun 20, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/30
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.