Patent · US Active

Reinforcement learning by sharing individual data within dynamic groups

US12026610B2 · kind B2 · utility

0Cited by

3References

20Claims

0Family size

Assignee

International Business Machines Corporation · US

Inventors

Chun Yang Ma · Beijing, CN
Zhi Hu Wang · Beijing, CN
Shiwan Zhao · Beijing, CN
Li Zhang · Yorktown Heights, US

Key dates

Filing date	Sep 25, 2018
Grant date	Jul 2, 2024
Priority date	—
Expiry date	Feb 7, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG05D1/227
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods and systems for reinforcement learning with dynamic agent grouping include gathering information at a first agent using one or more sensors. Shared information is received at the first agent from a second agent. An agent model is trained at the first agent using the gathered information and the shared information. A contribution of the shared information is weighted according to a degree of similarity between the first agent and the second agent. An action is generated using the trained agent model responsive to the gathered information.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.