Patent · US Active

System and method for correcting errors when generating a TTS voice

US7742921B1 · kind B1 · utility

16Cited by
15References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 27, 2005
Grant dateJun 22, 2010
Priority date
Expiry dateFeb 5, 2029

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L13/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Disclosed herein are various innovations associated with a toolkit used for generating a TTS voice for use in a spoken dialog system. The inventions in each case may be in the form of the system, a computer-readable medium or a method for generating the TTS voice. An embodiment of the invention relates to a method of enabling human workers to find errors when developing a text-to-speech (TTS) voice. The method comprises presenting a graphical user interface wherein after a first pass of automatic speech recognition (ASR) of a speech corpus is complete, the interface presents to a worker a graphical representation of an alignment of the ASR results, associated words and phonemes and the audio, receiving a graphical input from the worker associated with a selection of a word or phoneme and presenting the audio associated with the selected word or phoneme.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.