Patent · US Active

Text and audio-based real-time face reenactment

US11114086B2 · kind B2 · utility

0Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 11, 2019
Grant dateSep 7, 2021
Priority date
Expiry dateAug 22, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L13/08
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Provided are systems and methods for text and audio-based real-time face reenactment. An example method includes receiving an input text and a target image, the target image including a target face; generating, based on the input text, a sequence of sets of acoustic features representing the input text; determining, based on the sequence of sets of acoustic features, a sequence of sets of scenario data indicating modifications of the target face for pronouncing the input text; generating, based on the sequence of sets of scenario data, a sequence of frames, wherein each of the frames includes the target face modified based on at least one of the sets of scenario data; generating, based on the sequence of frames, an output video; and synthesizing, based on the sequence of sets of acoustic features, an audio data and adding the audio data to the output video.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.