Systems and methods for generating a video summary of a virtual event
US11889168B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 11, 2022 |
| Grant date | Jan 30, 2024 |
| Priority date | — |
| Expiry date | Jul 13, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/025
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
A video summary device may generate a textual summary of a transcription of a virtual event. The video summary device may generate a phonemic transcription of the textual summary and generate a text embedding based on the phonemic transcription. The video summary device may generate an audio embedding based on a target voice. The video summary device may generate an audio output of the phonemic transcription uttered by the target voice. The audio output may be generated based on the text embedding and the audio embedding. The video summary device may generate an image embedding based on video data of a target user. The image embedding may include information regarding images of facial movements of the target user. The video summary device may generate a video output of different facial movements of the target user uttering the phonemic transcription, based on the text embedding and the image embedding.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.