Patent · US Active

Using machine learning to train and use a model to perform automatic interface actions based on video and input datasets

US12315255B2 · kind B2 · utility

1Cited by

5References

22Claims

0Family size

Assignee

OpenAI OpCo, LLC · US

Inventors

Bowen Baker · Nevada City, US
Ilge Akkaya · Palo Alto, US
Peter Zhokhov · South San Francisco, US
Joost Huizinga · Emeryville, US
Jie Tang · Ibaraki, JP
Adrien Ecoffet · Burlingame, US
Brandon Houghton · San Francisco, US
Raul Sampedro Gonzalez · San Mateo, US
Jeffrey Michael Clune · San Francisco, US

Key dates

Filing date	Dec 19, 2023
Grant date	May 27, 2025
Priority date	—
Expiry date	Dec 19, 2043

Classification

Technology area (CPC G)Physics
CPC primaryG06V10/82
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Disclosed herein are methods, systems, and computer-readable media for training a machine learning model to label unlabeled data and/or perform automated actions. In an embodiment, a method comprises receiving unlabeled digital video data, generating pseudo-labels for the unlabeled digital video data, the generating comprising receiving labeled digital video data, training an inverse dynamics model (IDM) using the labeled digital video data, and generating at least one pseudo-label for the unlabeled digital video data, wherein the at least one pseudo-label is based on a prediction, generated by the IDM, of one or more actions that mimic at least one timestep of the unlabeled digital video data. In some embodiments, the method further comprises adding the at least one pseudo-label to the unlabeled digital video data and further training the IDM or a machine learning model using the pseudo-labeled digital video data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.