Patent · US Active

Reinforcement learning method, recording medium, and reinforcement learning system

US11543789B2 · kind B2 · utility

0Cited by
4References
13Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 21, 2020
Grant dateJan 3, 2023
Priority date
Expiry dateAug 13, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG05B2219/34082
  • WIPO fieldControl
  • WIPO sectorInstruments

Abstract

A reinforcement learning method executed by a computer includes calculating a degree of risk for a state of a controlled object at a current time point with respect to a constraint condition related to the state of the controlled object, the degree of risk being calculated based on a predicted value of the state of the controlled object at a future time point, the predicted value being obtained from model information defining a relationship between the state of the controlled object and a control input to the controlled object; and determining the control input to the controlled object at the current time point, from a range defined according to the calculated degree of risk so that the range becomes narrower as the calculated degree of risk increases.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.