Patent · US Active

Removing unnecessary history from reinforcement learning state

US11928556B2 · kind B2 · utility

0Cited by

4References

20Claims

0Family size

Assignee

International Business Machines Corporation · US

Inventors

Guy Hadash · Haifa, IL
Boaz Carmeli · Koranit, IL
George Kour · Tel Aviv-Yafo, IL

Key dates

Filing date	Dec 29, 2018
Grant date	Mar 12, 2024
Priority date	—
Expiry date	Jun 4, 2040

Classification

Technology area (CPC G)Physics
CPC primaryG06N7/01
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods and systems for a reinforcement learning system. A spatial and temporal representation of an observed state of an environment is encoded. A previous state is estimated from a given state and a size of a reward is adjusted based on a difference between the estimated previous state and the previous state.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.