An Attention-guided Multistream Feature Fusion Network for Localization   of Risky Objects in Driving Videos

Muhammad Monjurul Karim; Ruwen Qin; Zhaozheng Yin

arXiv:2209.07922·cs.CV·September 19, 2022·1 cites

An Attention-guided Multistream Feature Fusion Network for Localization of Risky Objects in Driving Videos

Muhammad Monjurul Karim, Ruwen Qin, Zhaozheng Yin

PDF

Open Access 1 Repo

TL;DR

This paper introduces AM-Net, an attention-guided multistream network that effectively localizes risky traffic objects in dashcam videos, improving safety by identifying dangerous agents through spatio-temporal cues and an attention mechanism.

Contribution

The paper presents a novel AM-Net architecture combining GRUs and attention modules for risky object localization, along with a new benchmark dataset for this task.

Findings

01

AM-Net achieves 85.73% AUC on the ROL dataset.

02

Outperforms state-of-the-art by 6.3% AUC on DoTA.

03

Ablation study confirms effectiveness of components.

Abstract

Detecting dangerous traffic agents in videos captured by vehicle-mounted dashboard cameras (dashcams) is essential to facilitate safe navigation in a complex environment. Accident-related videos are just a minor portion of the driving video big data, and the transient pre-accident processes are highly dynamic and complex. Besides, risky and non-risky traffic agents can be similar in their appearance. These make risky object localization in the driving video particularly challenging. To this end, this paper proposes an attention-guided multistream feature fusion network (AM-Net) to localize dangerous traffic agents from dashcam videos. Two Gated Recurrent Unit (GRU) networks use object bounding box and optical flow features extracted from consecutive video frames to capture spatio-temporal cues for distinguishing dangerous traffic agents. An attention module coupled with the GRUs learns…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

monjurulkarim/rol_dataset
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods · Human Pose and Action Recognition