First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024
Tengfei Zhang, Heng Zhang, Ruyang Li, Qi Deng, Yaqian Zhao, Rengang Li

TL;DR
This paper presents a comprehensive solution for spatiotemporal agent detection in videos, addressing challenges like object size, low-light conditions, and class imbalance, achieving first place in the ECCV 2024 ROAD++ Challenge.
Contribution
The authors introduce novel detection heads, a dual-stream model with low-light enhancement, and a multi-branch framework with optimized training strategies for improved agent detection.
Findings
Achieved 30.82% average video-mAP on the test set.
Ranked first in the ECCV 2024 ROAD++ Challenge.
Enhanced detection of extreme-size objects and low-light scenes.
Abstract
This report presents our team's solutions for the Track 1 of the 2024 ECCV ROAD++ Challenge. The task of Track 1 is spatiotemporal agent detection, which aims to construct an "agent tube" for road agents in consecutive video frames. Our solutions focus on the challenges in this task, including extreme-size objects, low-light scenarios, class imbalance, and fine-grained classification. Firstly, the extreme-size object detection heads are introduced to improve the detection performance of large and small objects. Secondly, we design a dual-stream detection model with a low-light enhancement stream to improve the performance of spatiotemporal agent detection in low-light scenes, and the feature fusion module to integrate features from different branches. Subsequently, we develop a multi-branch detection framework to mitigate the issues of class imbalance and fine-grained classification,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Data Stream Mining Techniques · Anomaly Detection Techniques and Applications
MethodsSparse Evolutionary Training · Focus
