Robustness Is a Function, Not a Number: A Factorized Comprehensive Study of OOD Robustness in Vision-Based Driving
Amir Mallak, Alaa Maalouf

TL;DR
This study decomposes environment variations in autonomous driving to analyze their impact on out-of-distribution robustness, revealing that vision transformer policies with foundation-model features significantly outperform traditional CNNs and FC networks across multiple challenging factors.
Contribution
The paper provides a comprehensive, factorized analysis of OOD robustness in vision-based driving, demonstrating the superiority of ViT policies with foundation-model features and offering actionable insights for robust policy design.
Findings
ViT policies outperform CNN/FC in OOD robustness.
Foundation-model features achieve state-of-the-art success with latency trade-offs.
Training on winter/snow enhances robustness to single-factor shifts.
Abstract
Out of distribution (OOD) robustness in autonomous driving is often reduced to a single number, hiding what breaks a policy. We decompose environments along five axes: scene (rural/urban), season, weather, time (day/night), and agent mix; and measure performance under controlled -factor perturbations (). Using closed loop control in VISTA, we benchmark FC, CNN, and ViT policies, train compact ViT heads on frozen foundation-model (FM) features, and vary ID support in scale, diversity, and temporal context. (1) ViT policies are markedly more OOD-robust than comparably sized CNN/FC, and FM features yield state-of-the-art success at a latency cost. (2) Naive temporal inputs (multi-frame) do not beat the best single-frame baseline. (3) The largest single factor drops are rural urban and day night ( each); actor swaps ,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Advanced Neural Network Applications · Human-Automation Interaction and Safety
