Bi-Directional Attention for Joint Instance and Semantic Segmentation in Point Clouds
Guangnan Wu, Zhiyi Pan, Peng Jiang, Changhe Tu

TL;DR
This paper introduces a Bi-Directional Attention module for 3D point cloud perception that enhances joint instance and semantic segmentation by facilitating mutual feature promotion and avoiding task conflicts.
Contribution
It proposes a novel bi-directional attention mechanism that improves multi-task learning in 3D point cloud segmentation tasks, surpassing simple feature fusion strategies.
Findings
Outperforms existing methods on S3DIS and PartNet datasets.
Demonstrates the effectiveness of bi-directional attention in mutual task enhancement.
Provides analysis of how the attention mechanism benefits joint segmentation.
Abstract
Instance segmentation in point clouds is one of the most fine-grained ways to understand the 3D scene. Due to its close relationship to semantic segmentation, many works approach these two tasks simultaneously and leverage the benefits of multi-task learning. However, most of them only considered simple strategies such as element-wise feature fusion, which may not lead to mutual promotion. In this work, we build a Bi-Directional Attention module on backbone neural networks for 3D point cloud perception, which uses similarity matrix measured from features for one task to help aggregate non-local information for the other task, avoiding the potential feature exclusion and task conflict. From comprehensive experiments and ablation studies on the S3DIS dataset and the PartNet dataset, the superiority of our method is verified. Moreover, the mechanism of how bi-directional attention module…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques · 3D Surveying and Cultural Heritage
