Patent · US Active

Alignment of video and textual sequences for metadata analysis

US10956685B2 · kind B2 · utility

0Cited by
16References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 10, 2020
Grant dateMar 23, 2021
Priority date
Expiry dateFeb 10, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/2276
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems, methods and computer program products related to aligning heterogeneous sequential data are disclosed. Video data in a media presentation and textual data corresponding to content of the media presentation are received. An action related to aligning the video data and the textual data is determined using an alignment neural network, such that the video data and the textual data are at least partially aligned following the action. The alignment neural network includes a first fully connected layer that receives as input the video data, the textual data, and data relating to a previously determined action by the alignment neural network related to aligning the video data and the textual data. The determined action related to aligning the video data and the textual data is performed.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.