Patent · US Active

Method and apparatus for speech synthesis

US10553201B2 · kind B2 · utility

0Cited by

0References

13Claims

0Family size

Assignee

BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD. · CN

Inventor

Zhiping Zhou · Marietta, US

Key dates

Filing date	Sep 18, 2018
Grant date	Feb 4, 2020
Priority date	—
Expiry date	Sep 18, 2038

Classification

Technology area (CPC G)Physics
CPC primaryG10L13/06
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method of speech synthesis is provided, which comprises: determining a phoneme sequence of a to-be-processed text; inputting the phoneme sequence into a pre-trained speech model to obtain an acoustic characteristic corresponding to each phoneme in the phoneme sequence, where the speech model is used for characterizing a corresponding relationship between each phoneme in the phoneme sequence and the acoustic characteristic; determining, for each phoneme in the phoneme sequence, at least one speech waveform unit corresponding to each phoneme based on a preset index of phonemes and speech waveform units, and determining a target speech waveform unit of the at least one speech waveform unit based on the acoustic characteristic corresponding to the phoneme and a preset cost function; and synthesizing the target speech waveform unit corresponding to each phoneme in the phoneme sequence to generate a speech.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.