Initial results of a reinforcement learning model using a heuristic
US11724194B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 28, 2021 |
| Grant date | Aug 15, 2023 |
| Priority date | — |
| Expiry date | Feb 1, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N3/092
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Systems and methods for improving initial results of a reinforcement learning model are described herein. In an embodiment, a server computer initiates a reinforcement learning model for a modeled system. While executing the reinforcement learning model, the server computer computes a first result value for a particular action using the reinforcement learning model and a second result value for the particular action using a heuristic separate from the reinforcement model. Based, at least in part, on the first result value for the particular action and the second result value for the particular action, the server computer performs the particular action. The server computer determining a result from performing the particular action and updates the reinforcement learning model.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.