Patent · US Active

Spatial and temporal attention-based deep reinforcement learning of hierarchical lane-change policies for controlling an autonomous vehicle

US10940863B2 · kind B2 · utility

5Cited by

2References

20Claims

0Family size

Assignees

GM GLOBAL TECHNOLOGY OPERATIONS LLC · US
CARNEGIE MELLON UNIVERSITY · US

Inventors

Praveen Palanisamy · Sterling Heights, US
Upali P. Mudalige · Oakland Charter Township, US
Yilun Chen · San Diego, US
John M. Dolan · Woodbury, US
Katharina Muelling · Pittsburgh, US

Key dates

Filing date	Nov 1, 2018
Grant date	Mar 9, 2021
Priority date	—
Expiry date	May 8, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG06N3/048
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Systems and methods are provided that employ spatial and temporal attention-based deep reinforcement learning of hierarchical lane-change policies for controlling an autonomous vehicle. An actor-critic network architecture includes an actor network that process image data received from an environment to learn the lane-change policies as a set of hierarchical actions, and a critic network that evaluates the lane-change policies to calculate loss and gradients to predict an action-value function (Q) that is used to drive learning and update parameters of the lane-change policies. The actor-critic network architecture implements a spatial attention module to select relevant regions in the image data that are of importance, and a temporal attention module to learn temporal attention weights to be applied to past frames of image data to indicate relative importance in deciding which lane-change policy to select.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.