Audio-driven three-dimensional facial animation model generation method and apparatus, and electronic device
US12254552B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 23, 2024 |
| Grant date | Mar 18, 2025 |
| Priority date | — |
| Expiry date | Dec 23, 2044 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06T17/00
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
This application provides a audio-driven three-dimensional facial animation model generation method and apparatus, and an electronic device. The method includes: acquiring sample data including sample audio data, sample speaking style data, and a sample blend shape value; performing feature extraction on the sample audio data to obtain a sample audio feature; performing convolution on the sample audio feature based on a to-be-trained audio-driven three-dimensional facial animation model to obtain an initial audio feature, and performing encoding on the sample speaking style data based on the to-be-trained audio-driven three-dimensional facial animation model to obtain a sample speaking style feature; performing encoding on the initial audio feature and the sample speaking style feature based on the to-be-trained audio-driven three-dimensional facial animation model, to obtain an output blend shape value; and performing calculation on the sample blend shape value and the output blend shape value to obtain a loss function value.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.