Patent · US Active

Text and audio-based real-time face reenactment

US11114086B2 · kind B2 · utility

0Cited by

0References

20Claims

0Family size

Assignee

SNAP INC. · US

Inventors

Pavel Savchenkov · London, GB
Maxim Lukin · Sochi, RU
Aleksandr Mashrabov · Los Angeles, US

Key dates

Filing date	Jul 11, 2019
Grant date	Sep 7, 2021
Priority date	—
Expiry date	Aug 22, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG10L13/08
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Provided are systems and methods for text and audio-based real-time face reenactment. An example method includes receiving an input text and a target image, the target image including a target face; generating, based on the input text, a sequence of sets of acoustic features representing the input text; determining, based on the sequence of sets of acoustic features, a sequence of sets of scenario data indicating modifications of the target face for pronouncing the input text; generating, based on the sequence of sets of scenario data, a sequence of frames, wherein each of the frames includes the target face modified based on at least one of the sets of scenario data; generating, based on the sequence of frames, an output video; and synthesizing, based on the sequence of sets of acoustic features, an audio data and adding the audio data to the output video.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.