Patent · US Active

Method for generating a dynamic image based on audio, device, and storage medium

US12260481B1 · kind B1 · utility

0Cited by
3References
9Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 19, 2024
Grant dateMar 25, 2025
Priority date
Expiry dateJul 19, 2044

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY02T10/40
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Disclosed are a method for generating a dynamic image based on audio, a device, and a storage medium, relating to the field of natural human-computer interactions. The method includes: obtaining a reference image and reference audio input by a user; determining a target head pose feature and a target expression coefficient feature based on the reference image and a trained generation network model, and adjusting the trained generation network model based on the target head pose feature and the target expression coefficient feature, to obtain a target generation network model; and processing a to-be-processed image based on the reference audio, the reference image, and the target generation network model, to obtain a target dynamic image. An image object in the to-be-processed image is same as that in the reference image. In this case, a corresponding digital person can be obtained based on a single picture of a target person.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.