Patent · US Active

Low-latency captioning system

US11445267B1 · kind B1 · utility

1Cited by
2References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 23, 2021
Grant dateSep 13, 2022
Priority date
Expiry dateJul 23, 2041

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04N21/44008
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

A scene captioning system is provided. The scene captioning system includes an interface configured to acquire a stream of scene data signals including frames and sound data, a memory to store a computer-executable scene captioning model including a scene encoder, a timing decoder, a timing detector, and a caption decoder, wherein the audio-visual encoder is shared by the timing decoder and the timing detector and the caption decoder, and a processor, in connection with the memory. The processor is configured to perform steps of extracting scene features from the scene data signals by use of the audio-visual encoder, determining a timing of generating a caption by use of the timing detector, wherein the timing is arranged an early stage of the stream of scene data signals, and generating the caption based on the scene features by using the caption decoder according to the timing.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.