Patent · US Active

Removing unnecessary history from reinforcement learning state

US11928556B2 · kind B2 · utility

0Cited by
4References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 29, 2018
Grant dateMar 12, 2024
Priority date
Expiry dateJun 4, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N7/01
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods and systems for a reinforcement learning system. A spatial and temporal representation of an observed state of an environment is encoded. A previous state is estimated from a given state and a size of a reward is adjusted based on a difference between the estimated previous state and the previous state.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.