Audio and video translator
US11908449B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 29, 2022 |
| Grant date | Feb 20, 2024 |
| Priority date | — |
| Expiry date | Nov 29, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/26
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system and method for translating audio, and video when desired. The translations include synthetic media and data generated using AI systems. Through unique processors and generators executing a unique sequence of steps, the system and method produces more accurate translations that can account for various speech characteristics (e.g., emotion, pacing, idioms, sarcasm, jokes, tone, phonemes, etc.). These speech characteristics are identified in the input media and synthetically incorporated into the translated outputs to mirror the characteristics in the input media. Some embodiments further include systems and methods that manipulate the input video such that the speakers' faces and/or lips appear as if they are natively speaking the generated audio.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.