Large Language Model based Interactive Decision-Making for Autonomous Driving

Xinwei Dong; Jiyang Li; Jiabin Xie; Yang Yi; Tianshang Jia; Shiyu Fang; Ye Tian; Peng Hang

arXiv:2604.23513·cs.RO·April 28, 2026

Large Language Model based Interactive Decision-Making for Autonomous Driving

Xinwei Dong, Jiyang Li, Jiabin Xie, Yang Yi, Tianshang Jia, Shiyu Fang, Ye Tian, Peng Hang

PDF

TL;DR

This paper introduces a Large Language Model-based framework for autonomous driving that enhances scene understanding, intent reasoning, and human-like communication to improve safety and acceptance in complex traffic scenarios.

Contribution

The novel integration of semantic scene modeling with Large Language Models enables interactive, intent-aware decision-making and communication in autonomous driving systems.

Findings

01

Outperforms traditional methods in safety, comfort, and efficiency metrics.

02

Achieves high human-likeness in decision-making as per Turing-test-style evaluation.

03

Demonstrates practical viability in a cluster driving simulator.

Abstract

In high-conflict mixed-traffic scenarios involving human-driven and autonomous vehicles, most existing autonomous driving systems default to overly conservative behaviors, lack proactive interaction, and consequently suffer from limited public acceptance. To mitigate intent misunderstandings and decision failures, we present a Large Language Model based interactive decision-making framework that augments scene understanding and intent-aware interaction to jointly improve safety and efficiency. The approach uses Object-Process Methodology to semantically model complex multi-vehicle scenes, abstracting low-level perceptual data into objects, processes, and relations, thereby streamlining reasoning over latent causal structure. Building on this representation, the Large Language Model parses both explicit and implicit intents of surrounding agents and, under jointly enforced safety and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.