Patent · US Active

Generating reinforcement learning data that is compatible with reinforcement learning for a robotic task

US11610153B1 · kind B1 · utility

2Cited by

0References

19Claims

0Family size

Assignee

X Development LLC · US

Inventors

Alexander Herzog · San Jose, US
Adrian Li · San Francisco, US
Mrinal Kalakrishnan · Palo Alto, US
Benjamin Holson · Sunnyvale, US

Key dates

Filing date	Dec 30, 2019
Grant date	Mar 21, 2023
Priority date	—
Expiry date	Jul 18, 2041

Classification

Technology area (CPC G)Physics
CPC primaryG05B2219/40499
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Utilizing at least one existing policy (e.g. a manually engineered policy) for a robotic task, in generating reinforcement learning (RL) data that can be used in training an RL policy for an instance of RL of the robotic task. The existing policy can be one that, standing alone, will not generate data that is compatible with the instance of RL for the robotic task. In contrast, the generated RL data is compatible with RL for the robotic task at least by virtue of it including state data that is in a state space of the RL for the robotic task, and including actions that are in the action space of the RL for the robotic task. The generated RL data can be used in at least some of the initial training for the RL policy using reinforcement learning.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.