System and method for correcting errors when generating a TTS voice
US7742921B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 27, 2005 |
| Grant date | Jun 22, 2010 |
| Priority date | — |
| Expiry date | Feb 5, 2029 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L13/00
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed herein are various innovations associated with a toolkit used for generating a TTS voice for use in a spoken dialog system. The inventions in each case may be in the form of the system, a computer-readable medium or a method for generating the TTS voice. An embodiment of the invention relates to a method of enabling human workers to find errors when developing a text-to-speech (TTS) voice. The method comprises presenting a graphical user interface wherein after a first pass of automatic speech recognition (ASR) of a speech corpus is complete, the interface presents to a worker a graphical representation of an alignment of the ASR results, associated words and phonemes and the audio, receiving a graphical input from the worker associated with a selection of a word or phoneme and presenting the audio associated with the selected word or phoneme.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.