Beyond the Beep: Scalable Collision Anticipation and Real-Time Explainability with BADAS-2.0
Roni Goldshmidt, Hamish Scott, Lorenzo Niccolini, and Hernan Matzner

TL;DR
BADAS-2.0 advances collision anticipation by expanding datasets, enabling real-time edge deployment through knowledge distillation, and providing explainability with attention heatmaps and reasoning models.
Contribution
It introduces a scalable, accurate, and explainable collision anticipation system with a large dataset, edge-compatible models, and real-time interpretability features.
Findings
Expanded dataset from 40k to 178,500 videos with high-risk scenarios.
Achieved 7-12x speedup with near-parity accuracy in compact models.
Produced real-time attention heatmaps and textual explanations for predictions.
Abstract
We present BADAS-2.0, the second generation of our collision anticipation system, building on BADAS-1.0, which showed that fine-tuning V-JEPA2 on large-scale ego-centric dashcam data outperforms both academic baselines and production ADAS systems. BADAS-2.0 advances the state of the art along three axes. (i) Long-tail benchmark and accuracy: We introduce a 10-group long-tail benchmark targeting rare and safety-critical scenarios. To construct it, BADAS-1.0 is used as an active oracle to score millions of unlabeled drives and surface high-risk candidates for annotation. Combined with Nexar's Atlas platform for targeted data collection, this expands the dataset from 40k to 178,500 labeled videos (~2M clips), yielding consistent gains across all subgroups, with the largest improvements on the hardest long-tail cases. (ii) Knowledge distillation to edge: Domain-specific self-supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
