Patent · US Active

Systems and methods for generating a video summary of a virtual event

US12200322B2 · kind B2 · utility

0Cited by
8References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 19, 2023
Grant dateJan 14, 2025
Priority date
Expiry dateDec 19, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2015/025
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

A video summary device may generate a textual summary of a transcription of a virtual event. The video summary device may generate a phonemic transcription of the textual summary and generate a text embedding based on the phonemic transcription. The video summary device may generate an audio embedding based on a target voice. The video summary device may generate an audio output of the phonemic transcription uttered by the target voice. The audio output may be generated based on the text embedding and the audio embedding. The video summary device may generate an image embedding based on video data of a target user. The image embedding may include information regarding images of facial movements of the target user. The video summary device may generate a video output of different facial movements of the target user uttering the phonemic transcription, based on the text embedding and the image embedding.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.