Patent · US Active

Using machine learning to train and use a model to perform automatic interface actions based on video and input datasets

US12315255B2 · kind B2 · utility

1Cited by
5References
22Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 19, 2023
Grant dateMay 27, 2025
Priority date
Expiry dateDec 19, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V10/82
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Disclosed herein are methods, systems, and computer-readable media for training a machine learning model to label unlabeled data and/or perform automated actions. In an embodiment, a method comprises receiving unlabeled digital video data, generating pseudo-labels for the unlabeled digital video data, the generating comprising receiving labeled digital video data, training an inverse dynamics model (IDM) using the labeled digital video data, and generating at least one pseudo-label for the unlabeled digital video data, wherein the at least one pseudo-label is based on a prediction, generated by the IDM, of one or more actions that mimic at least one timestep of the unlabeled digital video data. In some embodiments, the method further comprises adding the at least one pseudo-label to the unlabeled digital video data and further training the IDM or a machine learning model using the pseudo-labeled digital video data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.