BIDA: A Bi-level Interaction Decision-making Algorithm for Autonomous Vehicles in Dynamic Traffic Scenarios
Liyang Yu, Tianyi Wang, Junfeng Jiao, Fengwu Shan, Hongqing Chu, Bingzhao Gao

TL;DR
This paper introduces BIDA, a novel bi-level decision-making algorithm for autonomous vehicles that combines interactive Monte Carlo tree search with deep reinforcement learning to improve safety, efficiency, and interaction in dynamic traffic scenarios.
Contribution
The paper presents a new bi-level interaction decision-making framework integrating MCTS and DRL, specifically designed for complex traffic environments, which outperforms existing benchmarks.
Findings
Enhances interaction rationality and safety in AV decision-making.
Reduces computational costs compared to existing methods.
Outperforms benchmarks in safety, efficiency, and interaction in traffic scenarios.
Abstract
In complex real-world traffic environments, autonomous vehicles (AVs) need to interact with other traffic participants while making real-time and safety-critical decisions accordingly. The unpredictability of human behaviors poses significant challenges, particularly in dynamic scenarios, such as multi-lane highways and unsignalized T-intersections. To address this gap, we design a bi-level interaction decision-making algorithm (BIDA) that integrates interactive Monte Carlo tree search (MCTS) with deep reinforcement learning (DRL), aiming to enhance interaction rationality, efficiency and safety of AVs in dynamic key traffic scenarios. Specifically, we adopt three types of DRL algorithms to construct a reliable value network and policy network, which guide the online deduction process of interactive MCTS by assisting in value update and node selection. Then, a dynamic trajectory planner…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Autonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics
MethodsProximal Policy Optimization · ADaptive gradient method with the OPTimal convergence rate · CARLA: An Open Urban Driving Simulator
