Learning Negotiating Behavior Between Cars in Intersections using Deep Q-Learning
Tommy Tram, Anton Jansson, Robin Gr\"onberg, Mohammad Ali, Jonas, Sj\"oberg

TL;DR
This paper develops a deep reinforcement learning approach for automated vehicles to negotiate intersections, achieving high safety and efficiency by learning to adapt to other vehicles' behaviors.
Contribution
It introduces a deep Q-learning based policy for intersection negotiation that accounts for vehicle behaviors and intentions, improving safety and decision-making.
Findings
Achieves 98% collision avoidance rate in simulations.
Deep Recurrent Q-Network outperforms Deep Q-learning in intention inference.
Effective policy adapts speed to pass intersections efficiently.
Abstract
This paper concerns automated vehicles negotiating with other vehicles, typically human driven, in crossings with the goal to find a decision algorithm by learning typical behaviors of other vehicles. The vehicle observes distance and speed of vehicles on the intersecting road and use a policy that adapts its speed along its pre-defined trajectory to pass the crossing efficiently. Deep Q-learning is used on simulated traffic with different predefined driver behaviors and intentions. The results show a policy that is able to cross the intersection avoiding collision with other vehicles 98% of the time, while at the same time not being too passive. Moreover, inferring information over time is important to distinguish between different intentions and is shown by comparing the collision rate between a Deep Recurrent Q-Network at 0.85% and a Deep Q-learning at 1.75%.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Traffic control and management · Adversarial Robustness in Machine Learning
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Q-Learning
