Method for generating a dynamic image based on audio, device, and storage medium
US12260481B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 19, 2024 |
| Grant date | Mar 25, 2025 |
| Priority date | — |
| Expiry date | Jul 19, 2044 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY02T10/40
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed are a method for generating a dynamic image based on audio, a device, and a storage medium, relating to the field of natural human-computer interactions. The method includes: obtaining a reference image and reference audio input by a user; determining a target head pose feature and a target expression coefficient feature based on the reference image and a trained generation network model, and adjusting the trained generation network model based on the target head pose feature and the target expression coefficient feature, to obtain a target generation network model; and processing a to-be-processed image based on the reference audio, the reference image, and the target generation network model, to obtain a target dynamic image. An image object in the to-be-processed image is same as that in the reference image. In this case, a corresponding digital person can be obtained based on a single picture of a target person.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.