Removing unnecessary history from reinforcement learning state
US11928556B2 · kind B2 · utility
0Cited by
4References
20Claims
0Family size
Assignee
Inventors
Key dates
| Filing date | Dec 29, 2018 |
| Grant date | Mar 12, 2024 |
| Priority date | — |
| Expiry date | Jun 4, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N7/01
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods and systems for a reinforcement learning system. A spatial and temporal representation of an observed state of an environment is encoded. A previous state is estimated from a given state and a size of a reward is adjusted based on a difference between the estimated previous state and the previous state.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.