Patent · US Active

Dynamic resource allocation aided by reinforcement learning

US11616736B2 · kind B2 · utility

0Cited by

1References

20Claims

0Family size

Assignee

NOKIA SOLUTIONS AND NETWORKS OY · FI

Inventor

Paschalis Tsiaflakis · Schriek, BE

Key dates

Filing date	Dec 17, 2020
Grant date	Mar 28, 2023
Priority date	—
Expiry date	Apr 12, 2041

Classification

Technology area (CPC G)Physics
CPC primaryG06N3/088
WIPO fieldDigital communication
WIPO sectorElectrical engineering

Abstract

A communication system in which DRA control is aided by RL. An example embodiment may control one or more buffer queues populated by downstream and/or upstream data streams. The egress rates of the buffer queues can be dynamically controlled using an RL technique, according to which a learning agent can adaptively change the state-to-action mapping function of the DRA controller while circumventing the RL exploration phase and relying on extrapolation of the already taken actions instead. This feature may result in at least two benefits: (i) cancellation of a performance penalty typically associated with RL exploration; and (ii) faster learning of the environment, as the learning agent can determine the performance metrics of many actions per state in a single occurrence of the state. In an example embodiment, the communication system may be a DSL system, a PON system, or a wireless communication system.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.