TL;DR
This paper introduces RENet, a novel RGB-Event fusion network that leverages asynchronous event data and RGB images with multi-scale temporal aggregation and bi-directional feature fusion to improve moving object detection in autonomous driving.
Contribution
The paper presents a new RGB-Event fusion architecture with multi-scale temporal aggregation and bi-directional feature fusion for robust moving object detection.
Findings
RENet outperforms state-of-the-art RGB-Event methods.
The proposed modules effectively utilize asynchronous event data.
The dataset annotation improves evaluation accuracy.
Abstract
Moving Object Detection (MOD) is a critical vision task for successfully achieving safe autonomous driving. Despite plausible results of deep learning methods, most existing approaches are only frame-based and may fail to reach reasonable performance when dealing with dynamic traffic participants. Recent advances in sensor technologies, especially the Event camera, can naturally complement the conventional camera approach to better model moving objects. However, event-based works often adopt a pre-defined time window for event representation, and simply integrate it to estimate image intensities from events, neglecting much of the rich temporal information from the available asynchronous events. Therefore, from a new perspective, we propose RENet, a novel RGB-Event fusion Network, that jointly exploits the two complementary modalities to achieve more robust MOD under challenging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
