Improving the Multi-label Atomic Activity Recognition by Robust Visual Feature and Advanced Attention @ ROAD++ Atomic Activity Recognition 2024
Jiamin Cao, Lingqi Wang, Kexin Zhang, Yuting Yang, Licheng Jiao, Yuwei, Guo

TL;DR
This paper presents ROAD++ Track3's approach to multi-label atomic activity recognition in traffic videos, emphasizing robust visual features and advanced attention mechanisms to improve accuracy and generalization.
Contribution
The paper introduces optimized data processing, diverse visual backbones, an action-slot model, and weighted model fusion to enhance multi-label activity recognition performance.
Findings
Achieved a 58% mAP on the test set, outperforming the baseline by 4%.
Implemented a multi-faceted optimization strategy for robustness.
Demonstrated effectiveness of combining multiple models for better accuracy.
Abstract
Road++ Track3 proposes a multi-label atomic activity recognition task in traffic scenarios, which can be standardized as a 64-class multi-label video action recognition task. In the multi-label atomic activity recognition task, the robustness of visual feature extraction remains a key challenge, which directly affects the model performance and generalization ability. To cope with these issues, our team optimized three aspects: data processing, model and post-processing. Firstly, the appropriate resolution and video sampling strategy are selected, and a fixed sampling strategy is set on the validation and test sets. Secondly, in terms of model training, the team selects a variety of visual backbone networks for feature extraction, and then introduces the action-slot model, which is trained on the training and validation sets, and reasoned on the test set. Finally, for post-processing,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · EEG and Brain-Computer Interfaces · Machine Learning in Materials Science
MethodsSparse Evolutionary Training
