Reasoning About Traversability: Language-Guided Off-Road 3D Trajectory Planning
Byounggun Park, Soonmin Hwang

TL;DR
This paper introduces a novel framework for off-road trajectory planning that uses language refinement and terrain-aware optimization to improve autonomous driving in unstructured environments.
Contribution
It presents a new language refinement method and terrain-aware preference optimization to enhance scene understanding and trajectory accuracy in off-road driving scenarios.
Findings
Trajectory error reduced from 1.01m to 0.97m
Traversability compliance improved from 0.621 to 0.644
Elevation inconsistency decreased from 0.428 to 0.322
Abstract
While Vision-Language Models (VLMs) enable high-level semantic reasoning for end-to-end autonomous driving, particularly in unstructured environments, existing off-road datasets suffer from language annotations that are weakly aligned with vehicle actions and terrain geometry. To address this misalignment, we propose a language refinement framework that restructures annotations into action-aligned pairs, enabling a VLM to generate refined scene descriptions and 3D future trajectories directly from a single image. To further encourage terrain-aware planning, we introduce a preference optimization strategy that constructs geometry-aware hard negatives and explicitly penalizes trajectories inconsistent with local elevation profiles. Furthermore, we propose off-road-specific metrics to quantify traversability compliance and elevation consistency, addressing the limitations of conventional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
