Similarity-Aware Fusion Network for 3D Semantic Segmentation
Linqing Zhao, Jiwen Lu, Jie Zhou

TL;DR
SAFNet introduces a similarity-aware fusion approach that adaptively combines 2D images and 3D point clouds for improved 3D semantic segmentation, overcoming fixed fusion limitations and enhancing performance on real-world data.
Contribution
The paper presents a novel similarity-aware fusion network with geometric and contextual modules for adaptive multi-modal data integration in 3D segmentation.
Findings
Outperforms state-of-the-art fusion methods on ScanNetV2.
Effectively measures modality contribution for each point.
Robust to data with missing or unpaired modalities.
Abstract
In this paper, we propose a similarity-aware fusion network (SAFNet) to adaptively fuse 2D images and 3D point clouds for 3D semantic segmentation. Existing fusion-based methods achieve remarkable performances by integrating information from multiple modalities. However, they heavily rely on the correspondence between 2D pixels and 3D points by projection and can only perform the information fusion in a fixed manner, and thus their performances cannot be easily migrated to a more realistic scenario where the collected data often lack strict pair-wise features for prediction. To address this, we employ a late fusion strategy where we first learn the geometric and contextual similarities between the input and back-projected (from 2D pixels) point clouds and utilize them to guide the fusion of two modalities to further exploit complementary information. Specifically, we employ a geometric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Imaging and Analysis · 3D Shape Modeling and Analysis
