Patent · US Active

Method for generating a dynamic image based on audio, device, and storage medium

US12260481B1 · kind B1 · utility

0Cited by

3References

9Claims

0Family size

Assignee

NANJING SILICON INTELLIGENCE TECHNOLOGY CO., LTD. · CN

Inventors

Huapeng Sima · 安丰镇, CN
Maolin Zhang · Hangzhou City, CN
Liyan Mao · Nanjing, CN

Key dates

Filing date	Jul 19, 2024
Grant date	Mar 25, 2025
Priority date	—
Expiry date	Jul 19, 2044

Classification

Technology area (CPC Y)Emerging Cross-Sectional Technologies
CPC primaryY02T10/40
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Disclosed are a method for generating a dynamic image based on audio, a device, and a storage medium, relating to the field of natural human-computer interactions. The method includes: obtaining a reference image and reference audio input by a user; determining a target head pose feature and a target expression coefficient feature based on the reference image and a trained generation network model, and adjusting the trained generation network model based on the target head pose feature and the target expression coefficient feature, to obtain a target generation network model; and processing a to-be-processed image based on the reference audio, the reference image, and the target generation network model, to obtain a target dynamic image. An image object in the to-be-processed image is same as that in the reference image. In this case, a corresponding digital person can be obtained based on a single picture of a target person.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.