Patent · US Active

System and method for correcting errors when generating a TTS voice

US7742921B1 · kind B1 · utility

16Cited by

15References

18Claims

0Family size

Assignee

AT&T Intellectual Property I, L.P. · US

Inventors

Steven Lawrence Davis · Madelia, US
Shane Fetters · St. Peter, US
David Eugene Schulz · Wheaton, US
Beverly Gustafson · St. Peter, US
Louise Loney · Elysian, US

Key dates

Filing date	Sep 27, 2005
Grant date	Jun 22, 2010
Priority date	—
Expiry date	Feb 5, 2029

Classification

Technology area (CPC G)Physics
CPC primaryG10L13/00
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Disclosed herein are various innovations associated with a toolkit used for generating a TTS voice for use in a spoken dialog system. The inventions in each case may be in the form of the system, a computer-readable medium or a method for generating the TTS voice. An embodiment of the invention relates to a method of enabling human workers to find errors when developing a text-to-speech (TTS) voice. The method comprises presenting a graphical user interface wherein after a first pass of automatic speech recognition (ASR) of a speech corpus is complete, the interface presents to a worker a graphical representation of an alignment of the ASR results, associated words and phonemes and the audio, receiving a graphical input from the worker associated with a selection of a word or phoneme and presenting the audio associated with the selected word or phoneme.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.