EPSegFZ: Efficient Point Cloud Semantic Segmentation for Few- and Zero-Shot Scenarios with Language Guidance
Jiahui Wang, Haiyue Zhu, Haoren Guo, Abdullah Al Mamun, Cheng Xiang, Tong Heng Lee

TL;DR
EPSegFZ introduces a pre-training-free, language-guided point cloud segmentation model that enhances few- and zero-shot performance by integrating novel attention modules and textual support information.
Contribution
The paper proposes a novel pre-training-free network with modules that leverage textual data, improving few- and zero-shot 3D point cloud segmentation performance.
Findings
Outperforms state-of-the-art by 5.68% on S3DIS
Outperforms state-of-the-art by 3.82% on ScanNet
Effective use of textual support data enhances zero-shot capabilities
Abstract
Recent approaches for few-shot 3D point cloud semantic segmentation typically require a two-stage learning process, i.e., a pre-training stage followed by a few-shot training stage. While effective, these methods face overreliance on pre-training, which hinders model flexibility and adaptability. Some models tried to avoid pre-training yet failed to capture ample information. In addition, current approaches focus on visual information in the support set and neglect or do not fully exploit other useful data, such as textual annotations. This inadequate utilization of support information impairs the performance of the model and restricts its zero-shot ability. To address these limitations, we present a novel pre-training-free network, named Efficient Point Cloud Semantic Segmentation for Few- and Zero-shot scenarios. Our EPSegFZ incorporates three key components. A Prototype-Enhanced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
Topics3D Shape Modeling and Analysis · Robotics and Sensor-Based Localization · Robot Manipulation and Learning
