Patent · US Active

Method and system for adaptive cycle-level traffic signal control

US11783702B2 · kind B2 · utility

1Cited by

1References

20Claims

0Family size

Assignee

HUAWEI CLOUD COMPUTING TECHNOLOGIES CO., LTD. · CN

Inventors

Soheil MOHAMAD ALIZADEH SHABESTARY · Toronto, CA
Baher Abdulhai · Toronto, CA
Hao Hai Ma · Markham, CA
Yi-Jing Huo · Gongguan, TW

Key dates

Filing date	May 21, 2021
Grant date	Oct 10, 2023
Priority date	—
Expiry date	Sep 19, 2041

Classification

Technology area (CPC G)Physics
CPC primaryG06N7/01
WIPO fieldControl
WIPO sectorInstruments

Abstract

Methods, systems, and processor-readable media for adaptive cycle-level traffic signal control are described. An adaptive cycle-level traffic signal controller and control method that operate within a continuous action space. A reinforcement learning algorithm called Proximal Policy Optimization (PPO), which is a type of actor-critic model for reinforcement learning, may be used to generate signal cycle phase durations selected from a continuous range of values. The controller thus does not treat the action space as discrete, but instead produces continuous values as output. The generated phase durations may define a full traffic signal cycle. The inputs to the controller may indicate current and past states of the traffic environment. The average duration of delay of vehicles in the traffic environment may be used to calculate the reward for the reinforcement learning model that drives the behavior of the controller.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.