Patent · US Active

Meta-Q learning

US12217137B1 · kind B1 · utility

1Cited by
5References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 30, 2020
Grant dateFeb 4, 2025
Priority date
Expiry dateApr 21, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/006
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques for Meta-Q-Learning (MQL) are described. A method of MQL may include receiving a request from an agent to perform adaptation based at least on task data associated with a new task collected by the agent, identifying a subset of meta-training data corresponding to the task data in a replay buffer, and adapting a policy using the subset of meta-training data and the task data to generate an adapted policy, wherein the adapted policy is used identify a next action for the agent to perform.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.