Addressing Out-of-Label Hazard Detection in Dashcam Videos: Insights   from the COOOL Challenge

Anh-Kiet Duong; Petra Gomez-Kr\"amer

arXiv:2501.16037·cs.CV·February 11, 2025

Addressing Out-of-Label Hazard Detection in Dashcam Videos: Insights from the COOOL Challenge

Anh-Kiet Duong, Petra Gomez-Kr\"amer

PDF

Open Access 1 Repo

TL;DR

This paper introduces a comprehensive hazard detection system for dashcam videos that combines anomaly detection, ensemble weak classifiers with privacy measures, and vision-language models for captioning, achieving top results in the COOOL challenge.

Contribution

It presents a novel integrated approach for hazard detection and captioning in dashcam footage, utilizing unsupervised learning, ensemble methods, differential privacy, and advanced vision-language models.

Findings

01

Achieved highest scores in the COOOL challenge for hazard detection and captioning.

02

Demonstrated robustness of the ensemble method with privacy enhancements.

03

Effectively detected driver reactions and hazardous objects in dashcam videos.

Abstract

This paper presents a novel approach for hazard analysis in dashcam footage, addressing the detection of driver reactions to hazards, the identification of hazardous objects, and the generation of descriptive captions. We first introduce a method for detecting driver reactions through speed and sound anomaly detection, leveraging unsupervised learning techniques. For hazard detection, we employ a set of heuristic rules as weak classifiers, which are combined using an ensemble method. This ensemble approach is further refined with differential privacy to mitigate overconfidence, ensuring robustness despite the lack of labeled data. Lastly, we use state-of-the-art vision-language models for hazard captioning, generating descriptive labels for the detected hazards. Our method achieved the highest scores in the Challenge on Out-of-Label in Autonomous Driving, demonstrating its effectiveness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ffyyytt/coool_2025
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Sparse Evolutionary Training