Patent · US Active

System and method for detecting fabricated videos

US12322175B2 · kind B2 · utility

0Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 1, 2021
Grant dateJun 3, 2025
Priority date
Expiry dateNov 28, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L25/63
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A video is classified as real or fake by extracting facial features, including facial modalities and facial emotions, and speech features, including speech modalities and speech emotions, from the video. The facial and speech modalities are passed through first and second neural networks, respectively, to generate facial and speech modality embeddings. The facial and speech emotions are passed through third and fourth neural networks, respectively, to generate facial and speech emotion embeddings. A first distance, d1, between the facial modality embedding and the speech modality embedding is generated, together with a second distance, d2, between the facial emotion embedding and the speech emotion embedding. The video is classified as fake if a sum of the first distance and the second distance exceeds a threshold distance. The networks may be trained using real and fake video pairs for multiple subjects.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.