Patent · US Active

Learnable speed control for speech synthesis

US11302301B2 · kind B2 · utility

2Cited by

5References

16Claims

0Family size

Assignee

TENCENT AMERICA LLC · US

Inventors

Chengzhu Yu · Bellevue, US
Dong Yu · Bellevue, US

Key dates

Filing date	Mar 3, 2020
Grant date	Apr 12, 2022
Priority date	—
Expiry date	Apr 29, 2040

Classification

Technology area (CPC G)Physics
CPC primaryG06N3/088
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method, computer program, and computer system is provided for synthesizing speech at one or more speeds. A context associated with one or more phonemes corresponding to a speaking voice is encoded, and the one or more phonemes are aligned to one or more target acoustic frames based on the encoded context. One or more mel-spectrogram features are recursively generated from the aligned phonemes and target acoustic frames, and a voice sample corresponding to the speaking voice is synthesized using the generated mel-spectrogram features.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.