Generating reinforcement learning data that is compatible with reinforcement learning for a robotic task
US11610153B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 30, 2019 |
| Grant date | Mar 21, 2023 |
| Priority date | — |
| Expiry date | Jul 18, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG05B2219/40499
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Utilizing at least one existing policy (e.g. a manually engineered policy) for a robotic task, in generating reinforcement learning (RL) data that can be used in training an RL policy for an instance of RL of the robotic task. The existing policy can be one that, standing alone, will not generate data that is compatible with the instance of RL for the robotic task. In contrast, the generated RL data is compatible with RL for the robotic task at least by virtue of it including state data that is in a state space of the RL for the robotic task, and including actions that are in the action space of the RL for the robotic task. The generated RL data can be used in at least some of the initial training for the RL policy using reinforcement learning.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.