RPEFlow: Multimodal Fusion of RGB-PointCloud-Event for Joint Optical Flow and Scene Flow Estimation
Zhexiong Wan, Yuxin Mao, Jing Zhang, Yuchao Dai

TL;DR
RPEFlow is a multimodal fusion model that combines RGB images, point clouds, and event data to improve joint optical and scene flow estimation, especially in highly dynamic scenes, outperforming existing methods.
Contribution
The paper introduces a novel multi-stage multimodal fusion model with attention mechanisms and mutual information regularization for enhanced flow estimation.
Findings
Outperforms state-of-the-art methods on synthetic and real datasets.
Effectively leverages high-temporal-resolution event data.
Provides a new synthetic dataset for future research.
Abstract
Recently, the RGB images and point clouds fusion methods have been proposed to jointly estimate 2D optical flow and 3D scene flow. However, as both conventional RGB cameras and LiDAR sensors adopt a frame-based data acquisition mechanism, their performance is limited by the fixed low sampling rates, especially in highly-dynamic scenes. By contrast, the event camera can asynchronously capture the intensity changes with a very high temporal resolution, providing complementary dynamic information of the observed scenes. In this paper, we incorporate RGB images, Point clouds and Events for joint optical flow and scene flow estimation with our proposed multi-stage multimodal fusion model, RPEFlow. First, we present an attention fusion module with a cross-attention mechanism to implicitly explore the internal cross-modal correlation for 2D and 3D branches, respectively. Second, we introduce a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
RPEFlow: Multimodal Fusion of RGB-PointCloud-Event for Joint Optical Flow and Scene Flow Estimation· youtube
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Optical Sensing Technologies · Image Enhancement Techniques
