Patent · US Active

Method, device and computer program for producing a strategy for a robot

US11628562B2 · kind B2 · utility

2Cited by
2References
17Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 6, 2020
Grant dateApr 18, 2023
Priority date
Expiry dateJul 16, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG05B2219/40499
  • WIPO fieldControl
  • WIPO sectorInstruments

Abstract

A method for producing a strategy for a robot. The method includes the following steps: initializing the strategy and an episode length; repeated execution of the loop including the following steps: producing a plurality of further strategies as a function of the strategy; applying the plurality of the further strategies for the length of the episode length; ascertaining respectively a cumulative reward, which is obtained in the application of the respective further strategy; updating the strategy as a function of a second plurality of the further strategies that obtained the greatest cumulative rewards. After each execution of the loop, the episode length is increased. A computer program, a device for carrying out the method, and a machine-readable memory element on which the computer program is stored, are also described.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.