Patent · US Active

Methods and systems for image and voice processing

US11670024B2 · kind B2 · utility

0Cited by
29References
23Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 8, 2021
Grant dateJun 6, 2023
Priority date
Expiry dateMar 8, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2021/105
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face. The trained autoencoder is used to generate an output where the likeness of the face in the destination images is swapped with the likeness of the source face, while preserving expressions of the destination face.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.