Patent · US Active

Method and system for adaptive cycle-level traffic signal control

US11783702B2 · kind B2 · utility

1Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 21, 2021
Grant dateOct 10, 2023
Priority date
Expiry dateSep 19, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N7/01
  • WIPO fieldControl
  • WIPO sectorInstruments

Abstract

Methods, systems, and processor-readable media for adaptive cycle-level traffic signal control are described. An adaptive cycle-level traffic signal controller and control method that operate within a continuous action space. A reinforcement learning algorithm called Proximal Policy Optimization (PPO), which is a type of actor-critic model for reinforcement learning, may be used to generate signal cycle phase durations selected from a continuous range of values. The controller thus does not treat the action space as discrete, but instead produces continuous values as output. The generated phase durations may define a full traffic signal cycle. The inputs to the controller may indicate current and past states of the traffic environment. The average duration of delay of vehicles in the traffic environment may be used to calculate the reward for the reinforcement learning model that drives the behavior of the controller.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.