Patent · US Active

Speech synthesis method and apparatus, and readable storage medium

US12033612B2 · kind B2 · utility

1Cited by

0References

20Claims

0Family size

Assignee

TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED · CN

Inventors

Yibin ZHENG · Hartland, US
Xinhui Li · Beijing, CN
Li Lu · Lo Wu, CN

Key dates

Filing date	Nov 10, 2022
Grant date	Jul 9, 2024
Priority date	—
Expiry date	Nov 10, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG06N3/08
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A speech synthesis method includes: converting a text input sequence into a text feature representation sequence; inputting the text feature representation sequence into an encoder including N encoding layers; the N encoding layers including an encoding layer Ei and an encoding layer Ei+1; the encoding layer Ei+1 including a first multi-head self-attention network; acquiring a first attention matrix and a historical text encoded sequence outputted by the encoding layer Ei, and generating a second attention matrix of the encoding layer Ei+1 according to residual connection between the first attention matrix and the first multi-head self-attention network and the historical text encoded sequence; and generating a target text encoded sequence of the encoding layer Ei+1 according to the second attention matrix and the historical text encoded sequence, and generating synthesized speech data matched with the text input sequence based on the target text encoded sequence.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.