Patent · US Active

Environment prediction using reinforcement learning

US10733501B2 · kind B2 · utility

3Cited by

1References

20Claims

0Family size

Assignee

DeepMind Technologies Limited · GB

Inventors

David Silver · Hitchin, GB
Tom Schaul · London, GB
Matteo Hessel · London, GB
Hado Philip van Hasselt · London, GB

Key dates

Filing date	May 3, 2019
Grant date	Aug 4, 2020
Priority date	—
Expiry date	May 3, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG06N3/088
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for prediction of an outcome related to an environment. In one aspect, a system comprises a state representation neural network that is configured to: receive an observation characterizing a state of an environment being interacted with by an agent and process the observation to generate an internal state representation of the environment state; a prediction neural network that is configured to receive a current internal state representation of a current environment state and process the current internal state representation to generate a predicted subsequent state representation of a subsequent state of the environment and a predicted reward for the subsequent state; and a value prediction neural network that is configured to receive a current internal state representation of a current environment state and process the current internal state representation to generate a value prediction.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.