Patent · US Active

Generating reinforcement learning data that is compatible with reinforcement learning for a robotic task

US11610153B1 · kind B1 · utility

2Cited by
0References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 30, 2019
Grant dateMar 21, 2023
Priority date
Expiry dateJul 18, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG05B2219/40499
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Utilizing at least one existing policy (e.g. a manually engineered policy) for a robotic task, in generating reinforcement learning (RL) data that can be used in training an RL policy for an instance of RL of the robotic task. The existing policy can be one that, standing alone, will not generate data that is compatible with the instance of RL for the robotic task. In contrast, the generated RL data is compatible with RL for the robotic task at least by virtue of it including state data that is in a state space of the RL for the robotic task, and including actions that are in the action space of the RL for the robotic task. The generated RL data can be used in at least some of the initial training for the RL policy using reinforcement learning.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.