Patent · US Active

Method and system for text-to-speech synthesis with personalized voice

US9368102B2 · kind B2 · utility

1Cited by

20References

20Claims

0Family size

Assignee

Nuance Communications, Inc. · US

Inventors

Itzhack Goldberg · Hadera, IL
Ron Hoory · Haifa, IL
Boaz Mizrachi · Haifa, IL
Zvi Kons · Nesher, IL

Key dates

Filing date	Oct 10, 2014
Grant date	Jun 14, 2016
Priority date	—
Expiry date	Dec 25, 2034

Classification

Technology area (CPC G)Physics
CPC primaryG10L13/04
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method and system are provided for text-to-speech synthesis with personalized voice. The method includes receiving an incidental audio input (403) of speech in the form of an audio communication from an input speaker (401) and generating a voice dataset (404) for the input speaker (401). The method includes receiving a text input (411) at the same device as the audio input (403) and synthesizing (312) the text from the text input (411) to synthesized speech including using the voice dataset (404) to personalize the synthesized speech to sound like the input speaker (401). In addition, the method includes analyzing (316) the text for expression and adding the expression (315) to the synthesized speech. The audio communication may be part of a video communication (453) and the audio input (403) may have an associated visual input (455) of an image of the input speaker. The synthesis from text may include providing a synthesized image personalized to look like the image of the input speaker with expressions added from the visual input (455).

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.