Patent · US Active

Using machine-learning models to determine movements of a mouth corresponding to live speech

US11211060B2 · kind B2 · utility

3Cited by

9References

20Claims

0Family size

Assignee

Adobe Inc. · US

Inventors

Wilmot Wei-Mau Li · Seattle, US
Jovan Popovic · Seattle, US
Deepali Aneja · Seattle, US
David Simons · Seattle, US

Key dates

Filing date	May 29, 2020
Grant date	Dec 28, 2021
Priority date	—
Expiry date	May 29, 2040

Classification

Technology area (CPC G)Physics
CPC primaryG10L2021/105
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Disclosed systems and methods predict visemes from an audio sequence. In an example, a viseme-generation application accesses a first audio sequence that is mapped to a sequence of visemes. The first audio sequence has a first length and represents phonemes. The application adjusts a second length of a second audio sequence such that the second length equals the first length and represents the phonemes. The application adjusts the sequence of visemes to the second audio sequence such that phonemes in the second audio sequence correspond to the phonemes in the first audio sequence. The application trains a machine-learning model with the second audio sequence and the sequence of visemes. The machine-learning model predicts an additional sequence of visemes based on an additional sequence of audio.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.