Patent · US Active

Video retrieval techniques using video contrastive learning

US12277171B2 · kind B2 · utility

0Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 7, 2023
Grant dateApr 15, 2025
Priority date
Expiry dateJun 14, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/19093
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method, computer system, and a computer program product are provided for training a neural network for finding queried videos. Two pairs of video clips and associated text are obtained from a first dataset and a second dataset. The first dataset is used to train two video encoders by providing the video clips to the encoders as input and providing the outputs to a cosine similarity calculator. The second dataset is used to train a multi-mentor paradigm with two mentors. A first mentor and a second mentor are each provided the pair of textual data inputs. The first mentor provides a similarity value comparison, and the second mentor provides a word mover distance. Using the output from the multi-mentor paradigm and the encoders, a contrastive loss is calculated and used to provide contrastive learning of video features by differentiating similarity and dissimilarity of the video clips.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.