Patent · US Active

Photo-realistic synthesis of three dimensional animation with facial features synchronized with speech

US9613450B2 · kind B2 · utility

15Cited by

9References

20Claims

0Family size

Assignee

MICROSOFT TECHNOLOGY LICENSING, LLC · US

Inventors

Lijuan Wang · Guiyang, CN
Frank Kao-Ping Soong · Beijing, CN
Qiang Huo · Beijing, CN
Zhengyou Zhang · Redmond, US

Key dates

Filing date	May 3, 2011
Grant date	Apr 4, 2017
Priority date	—
Expiry date	Jun 1, 2034

Classification

Technology area (CPC G)Physics
CPC primaryG10L2021/105
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Dynamic texture mapping is used to create a photorealistic three dimensional animation of an individual with facial features synchronized with desired speech. Audiovisual data of an individual reading a known script is obtained and stored in an audio library and an image library. The audiovisual data is processed to extract feature vectors used to train a statistical model. An input audio feature vector corresponding to desired speech with which the animation will be synchronized is provided. The statistical model is used to generate a trajectory of visual feature vectors that corresponds to the input audio feature vector. These visual feature vectors are used to identify a matching image sequence from the image library. The resulting sequence of images, concatenated from the image library, provides a photorealistic image sequence with facial features, such as lip movements, synchronized with the desired speech. This image sequence is applied to the three-dimensional model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.