Beyond conventional vision: RGB-event fusion for robust object detection in dynamic traffic scenarios

Zhanwen Liu; Yujing Sun; Yang Wang; Nan Yang; Shengbo Eben Li; Xiangmo Zhao

arXiv:2508.10704·cs.CV·August 15, 2025

Beyond conventional vision: RGB-event fusion for robust object detection in dynamic traffic scenarios

Zhanwen Liu, Yujing Sun, Yang Wang, Nan Yang, Shengbo Eben Li, Xiangmo Zhao

PDF

TL;DR

This paper introduces MCFNet, a novel RGB-event fusion network that enhances object detection in challenging traffic scenarios by integrating high dynamic range event data with RGB images through advanced alignment and fusion modules.

Contribution

The paper proposes a new motion cue fusion network (MCFNet) with modules for event correction, dynamic upsampling, and adaptive cross-modal fusion, improving detection robustness in poor lighting and fast motion conditions.

Findings

01

MCFNet outperforms existing methods on DSEC-Det and PKU-DAVIS-SOD datasets.

02

Achieves 7.4% higher mAP50 on DSEC-Det.

03

Demonstrates robustness in nighttime and tunnel scenarios.

Abstract

The dynamic range limitation of conventional RGB cameras reduces global contrast and causes loss of high-frequency details such as textures and edges in complex traffic environments (e.g., nighttime driving, tunnels), hindering discriminative feature extraction and degrading frame-based object detection. To address this, we integrate a bio-inspired event camera with an RGB camera to provide high dynamic range information and propose a motion cue fusion network (MCFNet), which achieves optimal spatiotemporal alignment and adaptive cross-modal feature fusion under challenging lighting. Specifically, an event correction module (ECM) temporally aligns asynchronous event streams with image frames via optical-flow-based warping, jointly optimized with the detection network to learn task-aware event representations. The event dynamic upsampling module (EDUM) enhances spatial resolution of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.