GraphPilot: Grounded Scene Graph Conditioning for Language-Based Autonomous Driving
Fabian Schmidt, Markus Enzweiler, Abhinav Valada

TL;DR
GraphPilot introduces a novel method for autonomous driving that conditions language models on structured scene graphs, significantly improving reasoning about spatial and relational dynamics without needing scene graph input during inference.
Contribution
The paper presents a model-agnostic approach to incorporate scene graph conditioning into language-based driving models, enhancing relational reasoning capabilities.
Findings
Substantial performance improvements on LangAuto and Bench2Drive benchmarks.
Effective internalization of relational priors without scene graph input at test time.
Scene graph conditioning enhances spatial and relational understanding in autonomous driving models.
Abstract
Vision-language models have recently emerged as promising planners for autonomous driving, where success hinges on topology-aware reasoning over spatial structure and dynamic interactions from multimodal input. However, existing models are typically trained without supervision that explicitly encodes these relational dependencies, limiting their ability to infer how agents and other traffic entities influence one another from raw sensor data. In this work, we bridge this gap with a novel model-agnostic method that conditions language-based driving models on structured relational context in the form of traffic scene graphs. We serialize scene graphs at various abstraction levels and formats, and incorporate them into models via structured prompt templates, enabling systematic analysis of when and how relational supervision is most beneficial and computationally efficient. Extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Graph Neural Networks · Autonomous Vehicle Technology and Safety
