Patent · US Active

Video caption generating method and apparatus, device, and storage medium

US11743551B2 · kind B2 · utility

0Cited by
9References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 24, 2021
Grant dateAug 29, 2023
Priority date
Expiry dateDec 7, 2041

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04N21/8549
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

A video caption generating method is provided to a computer device. The method includes encoding a target video by using an encoder of a video caption generating model, to obtain a target visual feature of the target video, decoding the target visual feature by using a basic decoder of the video caption generating model, to obtain a first selection probability corresponding to a candidate word, decoding the target visual feature by using an auxiliary decoder of the video caption generating model, to obtain a second selection probability corresponding to the candidate word, a memory structure of the auxiliary decoder including reference visual context information corresponding to the candidate word, determining a decoded word in the candidate word according to the first selection probability and the second selection probability, and generating a video caption according to decoded word.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.