Patent · US Active

Reinforcement learning method, recording medium, and reinforcement learning system

US11543789B2 · kind B2 · utility

0Cited by

4References

13Claims

0Family size

Assignee

FUJITSU LIMITED · JP

Inventors

Yoshihiro Okawa · Kirishima, JP
Tomotake Sasaki · Kawasaki, JP
Hidenao Iwane · Kawasaki, JP
Hitoshi Yanami · Kawasaki, JP

Key dates

Filing date	Feb 21, 2020
Grant date	Jan 3, 2023
Priority date	—
Expiry date	Aug 13, 2040

Classification

Technology area (CPC G)Physics
CPC primaryG05B2219/34082
WIPO fieldControl
WIPO sectorInstruments

Abstract

A reinforcement learning method executed by a computer includes calculating a degree of risk for a state of a controlled object at a current time point with respect to a constraint condition related to the state of the controlled object, the degree of risk being calculated based on a predicted value of the state of the controlled object at a future time point, the predicted value being obtained from model information defining a relationship between the state of the controlled object and a control input to the controlled object; and determining the control input to the controlled object at the current time point, from a range defined according to the calculated degree of risk so that the range becomes narrower as the calculated degree of risk increases.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.