Patent · US Active

System and method for detecting fabricated videos

US12322175B2 · kind B2 · utility

0Cited by

1References

20Claims

0Family size

Assignee

UNIVERSITY OF MARYLAND, COLLEGE PARK · US

Inventors

Trisha Mittal · College Park, US
Uttaran Bhattacharya · Sunnyvale, US
Rohan Chandra · College Park, US
Aniket Bera · Greenbelt, US
Dinesh Manocha · Chapel Hill, US

Key dates

Filing date	Nov 1, 2021
Grant date	Jun 3, 2025
Priority date	—
Expiry date	Nov 28, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/63
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A video is classified as real or fake by extracting facial features, including facial modalities and facial emotions, and speech features, including speech modalities and speech emotions, from the video. The facial and speech modalities are passed through first and second neural networks, respectively, to generate facial and speech modality embeddings. The facial and speech emotions are passed through third and fourth neural networks, respectively, to generate facial and speech emotion embeddings. A first distance, d1, between the facial modality embedding and the speech modality embedding is generated, together with a second distance, d2, between the facial emotion embedding and the speech emotion embedding. The video is classified as fake if a sum of the first distance and the second distance exceeds a threshold distance. The networks may be trained using real and fake video pairs for multiple subjects.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.