Patent · US Active

Initial results of a reinforcement learning model using a heuristic

US11724194B2 · kind B2 · utility

1Cited by
4References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 28, 2021
Grant dateAug 15, 2023
Priority date
Expiry dateFeb 1, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/092
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems and methods for improving initial results of a reinforcement learning model are described herein. In an embodiment, a server computer initiates a reinforcement learning model for a modeled system. While executing the reinforcement learning model, the server computer computes a first result value for a particular action using the reinforcement learning model and a second result value for the particular action using a heuristic separate from the reinforcement model. Based, at least in part, on the first result value for the particular action and the second result value for the particular action, the server computer performs the particular action. The server computer determining a result from performing the particular action and updates the reinforcement learning model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.