An Attention-guided Multistream Feature Fusion Network for Localization of Risky Objects in Driving Videos
Muhammad Monjurul Karim, Ruwen Qin, Zhaozheng Yin

TL;DR
This paper introduces AM-Net, an attention-guided multistream network that effectively localizes risky traffic objects in dashcam videos, improving safety by identifying dangerous agents through spatio-temporal cues and an attention mechanism.
Contribution
The paper presents a novel AM-Net architecture combining GRUs and attention modules for risky object localization, along with a new benchmark dataset for this task.
Findings
AM-Net achieves 85.73% AUC on the ROL dataset.
Outperforms state-of-the-art by 6.3% AUC on DoTA.
Ablation study confirms effectiveness of components.
Abstract
Detecting dangerous traffic agents in videos captured by vehicle-mounted dashboard cameras (dashcams) is essential to facilitate safe navigation in a complex environment. Accident-related videos are just a minor portion of the driving video big data, and the transient pre-accident processes are highly dynamic and complex. Besides, risky and non-risky traffic agents can be similar in their appearance. These make risky object localization in the driving video particularly challenging. To this end, this paper proposes an attention-guided multistream feature fusion network (AM-Net) to localize dangerous traffic agents from dashcam videos. Two Gated Recurrent Unit (GRU) networks use object bounding box and optical flow features extracted from consecutive video frames to capture spatio-temporal cues for distinguishing dangerous traffic agents. An attention module coupled with the GRUs learns…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods · Human Pose and Action Recognition
