Multi-agent policy machine learning
US12355524B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 19, 2021 |
| Grant date | Jul 8, 2025 |
| Priority date | — |
| Expiry date | Jan 19, 2041 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04B7/0619
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
There is disclosed a method of operating a beam-forming wireless communication system, the system has a plurality of radio nodes, an actor neural network being associated to each radio node, wherein further to each actor neural network, there is associated a critic network. The method includes training each actor neural network, for controlling at least one associated radio node, based on learning feedback provided by its associated critic network, the learning feedback being based on operation information provided be the actor neural network for the critic network. The disclosure also pertains to related devices and methods.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.