Patent · US Active

Automatic synthesis of translated speech using speaker-specific phonemes

US11594226B2 · kind B2 · utility

0Cited by

4References

20Claims

0Family size

Assignee

International Business Machines Corporation · US

Inventors

Su Liu · Austin, US
Yang Liang · Beijing, CN
Debbie Anglin · Georgetown, US
Fan Yang · Redwood City, US

Key dates

Filing date	Dec 22, 2020
Grant date	Feb 28, 2023
Priority date	—
Expiry date	Dec 24, 2040

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/025
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

An embodiment includes converting an original audio signal to an original text string, the original audio signal being from a recording of the original text string spoken by a specific person in a source language. The embodiment generates a translated text string by translating the original text string from the source language to a target language, including translation of a word from the source language to a target language. The embodiment assembles a standard phoneme sequence from a set of standard phonemes, where the standard phoneme sequence includes a standard pronunciation of the translated word. The embodiment also associates a custom phoneme with a standard phoneme of the standard phoneme sequence, where the custom phoneme includes the specific person's pronunciation of a sound in the translated word. The embodiment synthesizes the translated text string to a translated audio signal including the translated word pronounced using the custom phoneme.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.