Deep reinforcement learning-based longitudinal control strategy for automated vehicles at signalised intersections
Pankaj Kumar, Aditya Mishra, Pranamesh Chakraborty, Subrahmanya Swamy Peruru

TL;DR
This paper introduces a deep reinforcement learning approach for autonomous vehicle control at signalized intersections, enhancing safety, efficiency, and comfort through novel reward functions and algorithm integration.
Contribution
It develops a DRL-based control strategy using DDPG and SAC algorithms with a comprehensive reward function tailored for intersection scenarios, trained on real and simulated data.
Findings
RL models maintain lower distance headway and jerk compared to human drivers.
Both models handle safety-critical scenarios effectively.
DDPG produces smoother actions than SAC.
Abstract
Developing an autonomous vehicle control strategy for signalised intersections (SI) is one of the challenging tasks due to its inherently complex decision-making process. This study proposes a Deep Reinforcement Learning (DRL) based longitudinal vehicle control strategy at SI. A comprehensive reward function has been formulated with a particular focus on (i) distance headway-based efficiency reward, (ii) decision-making criteria during amber light, and (iii) asymmetric acceleration/ deceleration response, along with the traditional safety and comfort criteria. This reward function has been incorporated with two popular DRL algorithms, Deep Deterministic Policy Gradient (DDPG) and Soft-Actor Critic (SAC), which can handle the continuous action space of acceleration/deceleration. The proposed models have been trained on the combination of real-world leader vehicle (LV) trajectories and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Traffic Prediction and Management Techniques · Vehicle emissions and performance
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Weight Decay · Adam · Experience Replay · Average Pooling · Dense Connections · Focus · Deep Deterministic Policy Gradient · Global Average Pooling
