Patent · US Active

Generating facial position data based on audio data

US11049308B2 · kind B2 · utility

1Cited by

1References

18Claims

0Family size

Assignee

Electronic Arts Inc. · US

Inventors

Jorge del Val Santos · Stockholm, SE
Linus Gisslén · Stockholm, SE
Martin Singh-Blom · Stockholm, SE
Kristoffer Sjöö · Stockholm, SE
Mattias Teye · Sundbyberg, SE

Key dates

Filing date	Apr 25, 2019
Grant date	Jun 29, 2021
Priority date	—
Expiry date	Apr 25, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG06T13/40
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A computer-implemented method for generating a machine-learned model to generate facial position data based on audio data comprising training a conditional variational autoencoder having an encoder and decoder. The training comprises receiving a set of training data items, each training data item comprising a facial position descriptor and an audio descriptor; processing one or more of the training data items using the encoder to obtain distribution parameters; sampling a latent vector from a latent space distribution based on the distribution parameters; processing the latent vector and the audio descriptor using the decoder to obtain a facial position output; calculating a loss value based at least in part on a comparison of the facial position output and the facial position descriptor of at least one of the one or more training data items; and updating parameters of the conditional variational autoencoder based at least in part on the calculated loss value.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.