Systems and methods for controllable video generation
US12413829B2 · kind B2 · utility
0Cited by
1References
20Claims
0Family size
Assignee
Inventors
Key dates
| Filing date | Jan 31, 2024 |
| Grant date | Sep 9, 2025 |
| Priority date | — |
| Expiry date | Jan 31, 2044 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06T2207/20182
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
Embodiments described herein provide a video generation framework built on a decoupled multimodal cross-attention module to simultaneously condition the generation on both an input image and a text input. The video generation may thus be conditioned on the visual appearance of a target object reflected in the input image. In this way, zero-shot video generation may be achieved with little fine-tuning efforts.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.