Meta-Q learning
US12217137B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 30, 2020 |
| Grant date | Feb 4, 2025 |
| Priority date | — |
| Expiry date | Apr 21, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N3/006
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques for Meta-Q-Learning (MQL) are described. A method of MQL may include receiving a request from an agent to perform adaptation based at least on task data associated with a new task collected by the agent, identifying a subset of meta-training data corresponding to the task data in a replay buffer, and adapting a policy using the subset of meta-training data and the task data to generate an adapted policy, wherein the adapted policy is used identify a next action for the agent to perform.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.