Hybrid Car-Following Strategy based on Deep Deterministic Policy   Gradient and Cooperative Adaptive Cruise Control

Ruidong Yan; Rui Jiang; Bin Jia; Jin Huang; and Diange Yang

arXiv:2103.03796·cs.AI·January 12, 2022

Hybrid Car-Following Strategy based on Deep Deterministic Policy Gradient and Cooperative Adaptive Cruise Control

Ruidong Yan, Rui Jiang, Bin Jia, Jin Huang, and Diange Yang

PDF

TL;DR

This paper introduces a hybrid car-following strategy combining deep reinforcement learning (DDPG) and cooperative adaptive cruise control (CACC) to enhance performance in complex driving environments, addressing limitations of existing methods.

Contribution

It proposes a novel hybrid approach that integrates DDPG and CACC, selecting the best action based on reward to improve car-following performance.

Findings

01

Improved car-following accuracy compared to standalone DDPG and CACC.

02

Enhanced stability and responsiveness in simulated driving scenarios.

03

Effective balance between exploration and rule-based control.

Abstract

Deep deterministic policy gradient (DDPG)-based car-following strategy can break through the constraints of the differential equation model due to the ability of exploration on complex environments. However, the car-following performance of DDPG is usually degraded by unreasonable reward function design, insufficient training, and low sampling efficiency. In order to solve this kind of problem, a hybrid car-following strategy based on DDPG and cooperative adaptive cruise control (CACC) is proposed. First, the car-following process is modeled as the Markov decision process to calculate CACC and DDPG simultaneously at each frame. Given a current state, two actions are obtained from CACC and DDPG, respectively. Then, an optimal action, corresponding to the one offering a larger reward, is chosen as the output of the hybrid strategy. Meanwhile, a rule is designed to ensure that the change…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAdam · Convolution · Weight Decay · Dense Connections · Batch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · Experience Replay · Deep Deterministic Policy Gradient