Automatic synthesis of translated speech using speaker-specific phonemes
US11594226B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 22, 2020 |
| Grant date | Feb 28, 2023 |
| Priority date | — |
| Expiry date | Dec 24, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/025
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An embodiment includes converting an original audio signal to an original text string, the original audio signal being from a recording of the original text string spoken by a specific person in a source language. The embodiment generates a translated text string by translating the original text string from the source language to a target language, including translation of a word from the source language to a target language. The embodiment assembles a standard phoneme sequence from a set of standard phonemes, where the standard phoneme sequence includes a standard pronunciation of the translated word. The embodiment also associates a custom phoneme with a standard phoneme of the standard phoneme sequence, where the custom phoneme includes the specific person's pronunciation of a sound in the translated word. The embodiment synthesizes the translated text string to a translated audio signal including the translated word pronounced using the custom phoneme.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.