Patent · US Expired

Text-to-speech using clustered context-dependent phoneme-based units

US6163769A · kind A · utility

235Cited by

6References

31Claims

0Family size

Assignee

Microsoft Corporation · US

Inventors

Alejandro Acero · Bellevue, US
Hsiao-Wuen Hon · Saratoga, US
Xuedong Huang · Bellevue, US

Key dates

Filing date	Oct 2, 1997
Grant date	Dec 19, 2000
Priority date	—
Expiry date	Oct 2, 2017

Classification

Technology area (CPC G)Physics
CPC primaryG10L13/07
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A text-to-speech system includes a storage device for storing a clustered set of context-dependent phoneme-based units of a target speaker. In one embodiment, decision trees are used wherein each decision tree based context-dependent phoneme-based unit is arranged based on context of at least one immediately preceding and succeeding phoneme. At least one of the context-dependent phoneme-based units represents other non-stored context-dependent phoneme units of similar sound due to similar contexts. A text analyzer obtains a string of phonetic symbols representative of text to be converted to speech. A concatenation module selects stored decision tree based context-dependent phoneme-based units from the set decision tree based context-dependent phoneme-based units based on the context of the phonetic symbols and synthesizes the selected phoneme-based units to generate speech corresponding to the text.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.