Text-to-speech using clustered context-dependent phoneme-based units
US6163769A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Oct 2, 1997 |
| Grant date | Dec 19, 2000 |
| Priority date | — |
| Expiry date | Oct 2, 2017 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L13/07
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A text-to-speech system includes a storage device for storing a clustered set of context-dependent phoneme-based units of a target speaker. In one embodiment, decision trees are used wherein each decision tree based context-dependent phoneme-based unit is arranged based on context of at least one immediately preceding and succeeding phoneme. At least one of the context-dependent phoneme-based units represents other non-stored context-dependent phoneme units of similar sound due to similar contexts. A text analyzer obtains a string of phonetic symbols representative of text to be converted to speech. A concatenation module selects stored decision tree based context-dependent phoneme-based units from the set decision tree based context-dependent phoneme-based units based on the context of the phonetic symbols and synthesizes the selected phoneme-based units to generate speech corresponding to the text.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.