Patent · US Active

Audio-driven three-dimensional facial animation model generation method and apparatus, and electronic device

US12254552B1 · kind B1 · utility

0Cited by

0References

8Claims

0Family size

Assignee

NANJING SILICON INTELLIGENCE TECHNOLOGY CO., LTD. · CN

Inventors

Huapeng Sima · 安丰镇, CN
Zheng Liao · 安丰镇, CN

Key dates

Filing date	Dec 23, 2024
Grant date	Mar 18, 2025
Priority date	—
Expiry date	Dec 23, 2044

Classification

Technology area (CPC G)Physics
CPC primaryG06T17/00
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

This application provides a audio-driven three-dimensional facial animation model generation method and apparatus, and an electronic device. The method includes: acquiring sample data including sample audio data, sample speaking style data, and a sample blend shape value; performing feature extraction on the sample audio data to obtain a sample audio feature; performing convolution on the sample audio feature based on a to-be-trained audio-driven three-dimensional facial animation model to obtain an initial audio feature, and performing encoding on the sample speaking style data based on the to-be-trained audio-driven three-dimensional facial animation model to obtain a sample speaking style feature; performing encoding on the initial audio feature and the sample speaking style feature based on the to-be-trained audio-driven three-dimensional facial animation model, to obtain an output blend shape value; and performing calculation on the sample blend shape value and the output blend shape value to obtain a loss function value.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.