System and method for detecting fabricated videos
US12322175B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 1, 2021 |
| Grant date | Jun 3, 2025 |
| Priority date | — |
| Expiry date | Nov 28, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/63
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A video is classified as real or fake by extracting facial features, including facial modalities and facial emotions, and speech features, including speech modalities and speech emotions, from the video. The facial and speech modalities are passed through first and second neural networks, respectively, to generate facial and speech modality embeddings. The facial and speech emotions are passed through third and fourth neural networks, respectively, to generate facial and speech emotion embeddings. A first distance, d1, between the facial modality embedding and the speech modality embedding is generated, together with a second distance, d2, between the facial emotion embedding and the speech emotion embedding. The video is classified as fake if a sum of the first distance and the second distance exceeds a threshold distance. The networks may be trained using real and fake video pairs for multiple subjects.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.