Patent · US Active

Meta-Q learning

US12217137B1 · kind B1 · utility

1Cited by

5References

20Claims

0Family size

Assignee

AMAZON TECHNOLOGIES, INC. · US

Inventors

Rasool Fakoor · San Jose, US
Alexander Johannes Smola · Sunnyvale, US
Stefano Soatto · Pasadena, US
Pratik Anil Chaudhari · Pasadena, US

Key dates

Filing date	Sep 30, 2020
Grant date	Feb 4, 2025
Priority date	—
Expiry date	Apr 21, 2043

Classification

Technology area (CPC G)Physics
CPC primaryG06N3/006
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Techniques for Meta-Q-Learning (MQL) are described. A method of MQL may include receiving a request from an agent to perform adaptation based at least on task data associated with a new task collected by the agent, identifying a subset of meta-training data corresponding to the task data in a replay buffer, and adapting a policy using the subset of meta-training data and the task data to generate an adapted policy, wherein the adapted policy is used identify a next action for the agent to perform.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.