Duration informed attention network (DURIAN) for audio-visual synthesis
US11670283B2 · kind B2 · utility
0Cited by
12References
18Claims
0Family size
Assignee
Inventors
Key dates
| Filing date | Aug 6, 2021 |
| Grant date | Jun 6, 2023 |
| Priority date | — |
| Expiry date | Aug 6, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2013/105
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and apparatus include receiving a text input that includes a sequence of text components. Respective temporal durations of the text components are determined using a duration model. A spectrogram frame is generated based on the duration model. An audio waveform is generated based on the spectrogram frame. Video information is generated based on the audio waveform. The audio waveform is provided as an output along with a corresponding video.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.