Low-latency captioning system
US11445267B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 23, 2021 |
| Grant date | Sep 13, 2022 |
| Priority date | — |
| Expiry date | Jul 23, 2041 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04N21/44008
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
A scene captioning system is provided. The scene captioning system includes an interface configured to acquire a stream of scene data signals including frames and sound data, a memory to store a computer-executable scene captioning model including a scene encoder, a timing decoder, a timing detector, and a caption decoder, wherein the audio-visual encoder is shared by the timing decoder and the timing detector and the caption decoder, and a processor, in connection with the memory. The processor is configured to perform steps of extracting scene features from the scene data signals by use of the audio-visual encoder, determining a timing of generating a caption by use of the timing detector, wherein the timing is arranged an early stage of the stream of scene data signals, and generating the caption based on the scene features by using the caption decoder according to the timing.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.