Patent · US Active

Methods and systems for image and voice processing

US10949715B1 · kind B1 · utility

4Cited by
23References
30Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 27, 2020
Grant dateMar 16, 2021
Priority date
Expiry dateMay 27, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2021/105
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face. The trained autoencoder is used to generate an output where the likeness of the face in the destination images is swapped with the likeness of the source face, while preserving expressions of the destination face.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.