Patent · US Active

Multi-agent coordination method and apparatus

US11948079B2 · kind B2 · utility

0Cited by
1References
18Claims
0Family size

Inventors

Key dates

Filing dateOct 19, 2020
Grant dateApr 2, 2024
Priority date
Expiry dateJul 9, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N7/01
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The present disclosure discloses a multi-agent coordination method. The method includes: performing multiple data collections on N agents to collect E sets of data, where N and E are integers greater than 1; and optimizing neural networks of the N agents using reinforcement learning based on the E sets of data. Each data collection includes: randomly selecting a first coordination pattern from multiple predetermined coordination patterns; obtaining N observations after the N agents act on an environment in the first coordination pattern; determining a first probability and a second probability that a current coordination pattern is the first coordination pattern based on the N observations; and determining a pseudo reward based on the first probability and the second probability. The E sets of data include: a first coordination pattern label indicating the first coordination pattern, the N observations, and the pseudo reward.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.