Patent · US Active

Multi-agent coordination method and apparatus

US11948079B2 · kind B2 · utility

0Cited by

1References

18Claims

0Family size

Inventors

Xiangyang Ji · Longbeilingcun, CN
Shuncheng He · Beijing, CN

Key dates

Filing date	Oct 19, 2020
Grant date	Apr 2, 2024
Priority date	—
Expiry date	Jul 9, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG06N7/01
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

The present disclosure discloses a multi-agent coordination method. The method includes: performing multiple data collections on N agents to collect E sets of data, where N and E are integers greater than 1; and optimizing neural networks of the N agents using reinforcement learning based on the E sets of data. Each data collection includes: randomly selecting a first coordination pattern from multiple predetermined coordination patterns; obtaining N observations after the N agents act on an environment in the first coordination pattern; determining a first probability and a second probability that a current coordination pattern is the first coordination pattern based on the N observations; and determining a pseudo reward based on the first probability and the second probability. The E sets of data include: a first coordination pattern label indicating the first coordination pattern, the N observations, and the pseudo reward.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.