Patent · US Active

Computer-implemented methods for machine learning model based spatial-temporal adaptive shift for end-to-end text-video retrieval

US12216709B1 · kind B1 · utility

0Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 22, 2023
Grant dateFeb 4, 2025
Priority date
Expiry dateAug 2, 2043

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04N21/232
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

Techniques for performing a machine learning model based spatial-temporal adaptive shift for end-to-end text-video retrieval are described. According to some examples, a computer-implemented method includes receiving a video comprising a plurality of frames at a content delivery service; generating, by the content delivery service, a set of embeddings for each of a plurality of sections of each frame of the plurality of frames; determining, by a candidate selector machine learning model of the content delivery service, a proper subset of the plurality of sections of each frame of the plurality of frames for a time shift based on the set of embeddings; time shifting, by the content delivery service, the proper subset of the plurality of sections of each frame of the plurality of frames to generate time shifted frames; generating, by the content delivery service, an updated set of embeddings based on the time shifted frames; receiving a search request comprising input text from a user device; determining the video is a match for the search request based on the input text and the updated set of embeddings for the time shifted frames; and sending the video to the user device based on t…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.