Human-Like Autonomous Car-Following Model with Deep Reinforcement Learning
Meixin Zhu, Xuesong Wang, Yinhai Wang

TL;DR
This paper introduces a deep reinforcement learning-based model for autonomous car-following that mimics human driving behavior more accurately than traditional models, with continuous learning capabilities and good generalization.
Contribution
It develops a novel deep RL framework for human-like car-following that outperforms existing models in accuracy and adaptability, using real driving data and continuous updates.
Findings
The DDPGvRT model achieves 18% spacing and 5% speed validation errors.
It outperforms traditional and recent data-driven models in accuracy.
The model demonstrates good generalization and adaptability to different drivers.
Abstract
This study proposes a framework for human-like autonomous car-following planning based on deep reinforcement learning (deep RL). Historical driving data are fed into a simulation environment where an RL agent learns from trial and error interactions based on a reward function that signals how much the agent deviates from the empirical data. Through these interactions, an optimal policy, or car-following model that maps in a human-like way from speed, relative speed between a lead and following vehicle, and inter-vehicle spacing to acceleration of a following vehicle is finally obtained. The model can be continuously updated when more data are fed in. Two thousand car-following periods extracted from the 2015 Shanghai Naturalistic Driving Study were used to train the model and compare its performance with that of traditional and recent data-driven car-following models. As shown by this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Transportation Planning and Optimization · Autonomous Vehicle Technology and Safety
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
