Patent · US Active

System and method for generating customized text-to-speech voices

US8666746B2 · kind B2 · utility

10Cited by
4References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 13, 2004
Grant dateMar 4, 2014
Priority date
Expiry dateJul 3, 2028

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L13/08
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.