Enhancing Traffic Object Detection in Variable Illumination with   RGB-Event Fusion

Zhanwen Liu; Nan Yang; Yang Wang; Yuke Li; Xiangmo Zhao; Fei-Yue Wang

arXiv:2311.00436·cs.CV·September 17, 2024·1 cites

Enhancing Traffic Object Detection in Variable Illumination with RGB-Event Fusion

Zhanwen Liu, Nan Yang, Yang Wang, Yuke Li, Xiangmo Zhao, Fei-Yue Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel fusion network leveraging RGB and event camera data to improve traffic object detection under variable illumination, achieving significant performance gains over traditional frame-based methods.

Contribution

The paper proposes SFNet, a structure-aware fusion network with RSGNet and AFCM modules, and introduces the DSEC-Det dataset for enhanced event-based traffic object detection.

Findings

01

Outperforms frame-based methods by 8.0% in mAP50

02

Generates illumination-robust representations

03

Provides a new dataset with 63,931 images and 208,000 labels

Abstract

Traffic object detection under variable illumination is challenging due to the information loss caused by the limited dynamic range of conventional frame-based cameras. To address this issue, we introduce bio-inspired event cameras and propose a novel Structure-aware Fusion Network (SFNet) that extracts sharp and complete object structures from the event stream to compensate for the lost information in images through cross-modality fusion, enabling the network to obtain illumination-robust representations for traffic object detection. Specifically, to mitigate the sparsity or blurriness issues arising from diverse motion states of traffic objects in fixed-interval event sampling methods, we propose the Reliable Structure Generation Network (RSGNet) to generate Speed Invariant Frames (SIF), ensuring the integrity and sharpness of object structures. Next, we design a novel Adaptive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yn-yang/sfnet
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Advanced Neural Network Applications · Brain Tumor Detection and Classification

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings