Patent · US Active

Audio and video translator

US11551664B2 · kind B2 · utility

0Cited by
1References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 5, 2022
Grant dateJan 10, 2023
Priority date
Expiry dateMay 5, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/26
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and method for translating audio, and video when desired. The translations include synthetic media and data generated using AI systems. Through unique processors and generators executing a unique sequence of steps, the system and method produces more accurate translations that can account for various speech characteristics (e.g., emotion, pacing, idioms, sarcasm, jokes, tone, phonemes, etc.). These speech characteristics are identified in the input media and synthetically incorporated into the translated outputs to mirror the characteristics in the input media. Some embodiments further include systems and methods that manipulate the input video such that the speakers' faces and/or lips appear as if they are natively speaking the generated audio.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.