TL;DR
This paper investigates catastrophic forgetting in vision-language models for autonomous driving, introduces a new dataset and benchmark, and proposes the Drive Expert Adapter to mitigate knowledge loss while maintaining performance.
Contribution
It is the first systematic study of forgetting in driving models, introducing a novel prompt-based adaptation framework that preserves pre-trained knowledge.
Findings
Existing methods cause significant knowledge degradation.
The Drive Expert Adapter improves driving task performance.
DEA effectively mitigates catastrophic forgetting.
Abstract
The integration of Vision-Language Models (VLMs) into autonomous driving promises to solve long-tail scenarios, but this paradigm faces the critical and unaddressed challenge of catastrophic forgetting. The very fine-tuning process used to adapt these models to driving-specific data simultaneously erodes their invaluable pre-trained world knowledge, creating a self-defeating paradox that undermines the core reason for their use. This paper provides the first systematic investigation into this phenomenon. We introduce a new large-scale dataset of 180K scenes, which enables the first-ever benchmark specifically designed to quantify catastrophic forgetting in autonomous driving. Our analysis reveals that existing methods suffer from significant knowledge degradation. To address this, we propose the Drive Expert Adapter (DEA), a novel framework that circumvents this trade-off by shifting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
