Patent · US Active

Audio-driven three-dimensional facial animation model generation method and apparatus, and electronic device

US12254552B1 · kind B1 · utility

0Cited by
0References
8Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 23, 2024
Grant dateMar 18, 2025
Priority date
Expiry dateDec 23, 2044

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06T17/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

This application provides a audio-driven three-dimensional facial animation model generation method and apparatus, and an electronic device. The method includes: acquiring sample data including sample audio data, sample speaking style data, and a sample blend shape value; performing feature extraction on the sample audio data to obtain a sample audio feature; performing convolution on the sample audio feature based on a to-be-trained audio-driven three-dimensional facial animation model to obtain an initial audio feature, and performing encoding on the sample speaking style data based on the to-be-trained audio-driven three-dimensional facial animation model to obtain a sample speaking style feature; performing encoding on the initial audio feature and the sample speaking style feature based on the to-be-trained audio-driven three-dimensional facial animation model, to obtain an output blend shape value; and performing calculation on the sample blend shape value and the output blend shape value to obtain a loss function value.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.