Method and apparatus for speech synthesis
US10553201B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Sep 18, 2018 |
| Grant date | Feb 4, 2020 |
| Priority date | — |
| Expiry date | Sep 18, 2038 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L13/06
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method of speech synthesis is provided, which comprises: determining a phoneme sequence of a to-be-processed text; inputting the phoneme sequence into a pre-trained speech model to obtain an acoustic characteristic corresponding to each phoneme in the phoneme sequence, where the speech model is used for characterizing a corresponding relationship between each phoneme in the phoneme sequence and the acoustic characteristic; determining, for each phoneme in the phoneme sequence, at least one speech waveform unit corresponding to each phoneme based on a preset index of phonemes and speech waveform units, and determining a target speech waveform unit of the at least one speech waveform unit based on the acoustic characteristic corresponding to the phoneme and a preset cost function; and synthesizing the target speech waveform unit corresponding to each phoneme in the phoneme sequence to generate a speech.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.