Large Language Model based Interactive Decision-Making for Autonomous Driving
Xinwei Dong, Jiyang Li, Jiabin Xie, Yang Yi, Tianshang Jia, Shiyu Fang, Ye Tian, Peng Hang

TL;DR
This paper introduces a Large Language Model-based framework for autonomous driving that enhances scene understanding, intent reasoning, and human-like communication to improve safety and acceptance in complex traffic scenarios.
Contribution
The novel integration of semantic scene modeling with Large Language Models enables interactive, intent-aware decision-making and communication in autonomous driving systems.
Findings
Outperforms traditional methods in safety, comfort, and efficiency metrics.
Achieves high human-likeness in decision-making as per Turing-test-style evaluation.
Demonstrates practical viability in a cluster driving simulator.
Abstract
In high-conflict mixed-traffic scenarios involving human-driven and autonomous vehicles, most existing autonomous driving systems default to overly conservative behaviors, lack proactive interaction, and consequently suffer from limited public acceptance. To mitigate intent misunderstandings and decision failures, we present a Large Language Model based interactive decision-making framework that augments scene understanding and intent-aware interaction to jointly improve safety and efficiency. The approach uses Object-Process Methodology to semantically model complex multi-vehicle scenes, abstracting low-level perceptual data into objects, processes, and relations, thereby streamlining reasoning over latent causal structure. Building on this representation, the Large Language Model parses both explicit and implicit intents of surrounding agents and, under jointly enforced safety and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
