Deep reinforcement learning-based longitudinal control strategy for automated vehicles at signalised intersections

Pankaj Kumar; Aditya Mishra; Pranamesh Chakraborty; Subrahmanya Swamy Peruru

arXiv:2505.08896·cs.AI·May 15, 2025

Deep reinforcement learning-based longitudinal control strategy for automated vehicles at signalised intersections

Pankaj Kumar, Aditya Mishra, Pranamesh Chakraborty, Subrahmanya Swamy Peruru

PDF

Open Access

TL;DR

This paper introduces a deep reinforcement learning approach for autonomous vehicle control at signalized intersections, enhancing safety, efficiency, and comfort through novel reward functions and algorithm integration.

Contribution

It develops a DRL-based control strategy using DDPG and SAC algorithms with a comprehensive reward function tailored for intersection scenarios, trained on real and simulated data.

Findings

01

RL models maintain lower distance headway and jerk compared to human drivers.

02

Both models handle safety-critical scenarios effectively.

03

DDPG produces smoother actions than SAC.

Abstract

Developing an autonomous vehicle control strategy for signalised intersections (SI) is one of the challenging tasks due to its inherently complex decision-making process. This study proposes a Deep Reinforcement Learning (DRL) based longitudinal vehicle control strategy at SI. A comprehensive reward function has been formulated with a particular focus on (i) distance headway-based efficiency reward, (ii) decision-making criteria during amber light, and (iii) asymmetric acceleration/ deceleration response, along with the traditional safety and comfort criteria. This reward function has been incorporated with two popular DRL algorithms, Deep Deterministic Policy Gradient (DDPG) and Soft-Actor Critic (SAC), which can handle the continuous action space of acceleration/deceleration. The proposed models have been trained on the combination of real-world leader vehicle (LV) trajectories and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management · Traffic Prediction and Management Techniques · Vehicle emissions and performance

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Weight Decay · Adam · Experience Replay · Average Pooling · Dense Connections · Focus · Deep Deterministic Policy Gradient · Global Average Pooling