BIDA: A Bi-level Interaction Decision-making Algorithm for Autonomous Vehicles in Dynamic Traffic Scenarios

Liyang Yu; Tianyi Wang; Junfeng Jiao; Fengwu Shan; Hongqing Chu; Bingzhao Gao

arXiv:2506.16546·cs.RO·June 23, 2025

BIDA: A Bi-level Interaction Decision-making Algorithm for Autonomous Vehicles in Dynamic Traffic Scenarios

Liyang Yu, Tianyi Wang, Junfeng Jiao, Fengwu Shan, Hongqing Chu, Bingzhao Gao

PDF

Open Access

TL;DR

This paper introduces BIDA, a novel bi-level decision-making algorithm for autonomous vehicles that combines interactive Monte Carlo tree search with deep reinforcement learning to improve safety, efficiency, and interaction in dynamic traffic scenarios.

Contribution

The paper presents a new bi-level interaction decision-making framework integrating MCTS and DRL, specifically designed for complex traffic environments, which outperforms existing benchmarks.

Findings

01

Enhances interaction rationality and safety in AV decision-making.

02

Reduces computational costs compared to existing methods.

03

Outperforms benchmarks in safety, efficiency, and interaction in traffic scenarios.

Abstract

In complex real-world traffic environments, autonomous vehicles (AVs) need to interact with other traffic participants while making real-time and safety-critical decisions accordingly. The unpredictability of human behaviors poses significant challenges, particularly in dynamic scenarios, such as multi-lane highways and unsignalized T-intersections. To address this gap, we design a bi-level interaction decision-making algorithm (BIDA) that integrates interactive Monte Carlo tree search (MCTS) with deep reinforcement learning (DRL), aiming to enhance interaction rationality, efficiency and safety of AVs in dynamic key traffic scenarios. Specifically, we adopt three types of DRL algorithms to construct a reliable value network and policy network, which guide the online deduction process of interactive MCTS by assisting in value update and node selection. Then, a dynamic trajectory planner…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management · Autonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics

MethodsProximal Policy Optimization · ADaptive gradient method with the OPTimal convergence rate · CARLA: An Open Urban Driving Simulator