Patent · US Active

Dynamic resource allocation aided by reinforcement learning

US11616736B2 · kind B2 · utility

0Cited by
1References
20Claims
0Family size

Assignee

Inventor

Key dates

Filing dateDec 17, 2020
Grant dateMar 28, 2023
Priority date
Expiry dateApr 12, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/088
  • WIPO fieldDigital communication
  • WIPO sectorElectrical engineering

Abstract

A communication system in which DRA control is aided by RL. An example embodiment may control one or more buffer queues populated by downstream and/or upstream data streams. The egress rates of the buffer queues can be dynamically controlled using an RL technique, according to which a learning agent can adaptively change the state-to-action mapping function of the DRA controller while circumventing the RL exploration phase and relying on extrapolation of the already taken actions instead. This feature may result in at least two benefits: (i) cancellation of a performance penalty typically associated with RL exploration; and (ii) faster learning of the environment, as the learning agent can determine the performance metrics of many actions per state in a single occurrence of the state. In an example embodiment, the communication system may be a DSL system, a PON system, or a wireless communication system.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.