Generation of video stream having localized lip-syncing with personalized characteristics
US12278999B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 21, 2023 |
| Grant date | Apr 15, 2025 |
| Priority date | — |
| Expiry date | Jun 21, 2043 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04N21/43072
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
A computer-implemented method, in accordance with one embodiment, includes detecting cultural context and accents of speakers portrayed in a video stream and/or an audience of the video stream. Accent tags are selected for the speakers according to the cultural context and accents of the speakers and/or the audience of the video stream. A textual representation of spoken words of the speakers is translated from a source language to a target language. The accent tags are applied to the textual representation of the spoken words in the target language according to the speakers corresponding to the textual representation of the spoken words in the target language. Speech lip movements of the speakers portrayed in the video stream are modified to match the target language and the locale accent tags. A translated video stream having the speakers appearing to speak in the target language with the modified lip movements is output.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.