Method, device and computer program for producing a strategy for a robot
US11628562B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 6, 2020 |
| Grant date | Apr 18, 2023 |
| Priority date | — |
| Expiry date | Jul 16, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG05B2219/40499
- WIPO fieldControl
- WIPO sectorInstruments
Abstract
A method for producing a strategy for a robot. The method includes the following steps: initializing the strategy and an episode length; repeated execution of the loop including the following steps: producing a plurality of further strategies as a function of the strategy; applying the plurality of the further strategies for the length of the episode length; ascertaining respectively a cumulative reward, which is obtained in the application of the respective further strategy; updating the strategy as a function of a second plurality of the further strategies that obtained the greatest cumulative rewards. After each execution of the loop, the episode length is increased. A computer program, a device for carrying out the method, and a machine-readable memory element on which the computer program is stored, are also described.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.