Patent · US Active

Speech synthesis with fuzzy heteronym prediction using decision trees

US9058811B2 · kind B2 · utility

240Cited by

12References

10Claims

0Family size

Assignee

Kabushiki Kaisha Toshiba · JP

Inventors

Xi Wang · Singapore, SG
Xiaoyan Lou · Beijing, CN
Jian-Feng Li · Blacksburg, US

Key dates

Filing date	Feb 22, 2012
Grant date	Jun 16, 2015
Priority date	—
Expiry date	Jul 1, 2033

Classification

Technology area (CPC G)Physics
CPC primaryG10L13/08
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

According to one embodiment, a method, apparatus for synthesizing speech, and a method for training acoustic model used in speech synthesis is provided. The method for synthesizing speech may include determining data generated by text analysis as fuzzy heteronym data, performing fuzzy heteronym prediction on the fuzzy heteronym data to output a plurality of candidate pronunciations of the fuzzy heteronym data and probabilities thereof, generating fuzzy context feature labels based on the plurality of candidate pronunciations and probabilities thereof, determining model parameters for the fuzzy context feature labels based on acoustic model with fuzzy decision tree, generating speech parameters from the model parameters, and synthesizing the speech parameters via synthesizer as speech.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.