SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation

Ziyi Wang; Yongming Rao; Xumin Yu; Jie Zhou; Jiwen Lu

arXiv:2205.13490·cs.CV·May 27, 2022

SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation

Ziyi Wang, Yongming Rao, Xumin Yu, Jie Zhou, Jiwen Lu

PDF

Open Access 1 Repo

TL;DR

SemAffiNet introduces a semantic-affine transformation technique that enhances mid-level features with class-specific information, improving point cloud segmentation accuracy by capturing global context and reducing confusion between similar local parts.

Contribution

The paper proposes a novel semantic-affine transformation method and integrates it into a Transformer-based network for improved 3D point cloud segmentation.

Findings

01

Outperforms existing methods on ScanNetV2 and NYUv2 datasets.

02

Demonstrates superior generalization across different segmentation baselines.

03

Qualitative results show clearer category boundaries and reduced confusion.

Abstract

Conventional point cloud semantic segmentation methods usually employ an encoder-decoder architecture, where mid-level features are locally aggregated to extract geometric information. However, the over-reliance on these class-agnostic local geometric representations may raise confusion between local parts from different categories that are similar in appearance or spatially adjacent. To address this issue, we argue that mid-level features can be further enhanced with semantic information, and propose semantic-affine transformation that transforms features of mid-level points belonging to different categories with class-specific affine parameters. Based on this technique, we propose SemAffiNet for point cloud semantic segmentation, which utilizes the attention mechanism in the Transformer module to implicitly and explicitly capture global structural knowledge within local parts for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wangzy22/SemAffiNet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Advanced Neural Network Applications

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Layer Normalization · Byte Pair Encoding · Dense Connections · Absolute Position Encodings · Dropout · Adam