Patent · US Active

Initial results of a reinforcement learning model using a heuristic

US11724194B2 · kind B2 · utility

1Cited by

4References

16Claims

0Family size

Assignee

BLIZZARD ENTERTAINMENT, INC. · US

Inventors

Wayne Yang · Lake Forest, US
David Pendergrast · Irvine, US
Alexander Zook · Irvine, US

Key dates

Filing date	Jul 28, 2021
Grant date	Aug 15, 2023
Priority date	—
Expiry date	Feb 1, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG06N3/092
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Systems and methods for improving initial results of a reinforcement learning model are described herein. In an embodiment, a server computer initiates a reinforcement learning model for a modeled system. While executing the reinforcement learning model, the server computer computes a first result value for a particular action using the reinforcement learning model and a second result value for the particular action using a heuristic separate from the reinforcement model. Based, at least in part, on the first result value for the particular action and the second result value for the particular action, the server computer performs the particular action. The server computer determining a result from performing the particular action and updates the reinforcement learning model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.