Patent · US Active

Methods, apparatus and data structure for cross-language speech adaptation

US9798653B1 · kind B1 · utility

7Cited by

68References

7Claims

0Family size

Assignee

Nuance Communications, Inc. · US

Inventors

Xu Shao · Irvine, US
Andrew Paul Breen · Norwich, GB

Key dates

Filing date	May 5, 2010
Grant date	Oct 24, 2017
Priority date	—
Expiry date	Dec 15, 2030

Classification

Technology area (CPC G)Physics
CPC primaryG10L13/086
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Adapted speech models produce fluent synthesized speech in a voice that sounds as if the speaker were fluent in a language in which the speaker is actually non-fluent. A full speech model is obtained based on fluent speech in the language spoken by a first person who is fluent in the language. A limited set of utterances is obtained in the language spoken by a second person who is non-fluent in the language but able to speak the limited set of utterances in the language. The full speech model of the first person is then processed with the limited set of utterances of the second person to produce an adapted speech model. The adapted speech model may be stored to a multi-lingual speech model as a child node of a root with an associated language selection question and branches pointed to the adapted speech model and other speech models, respectively.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.