EPSegFZ: Efficient Point Cloud Semantic Segmentation for Few- and Zero-Shot Scenarios with Language Guidance

Jiahui Wang; Haiyue Zhu; Haoren Guo; Abdullah Al Mamun; Cheng Xiang; Tong Heng Lee

arXiv:2511.11700·cs.CV·November 18, 2025

EPSegFZ: Efficient Point Cloud Semantic Segmentation for Few- and Zero-Shot Scenarios with Language Guidance

Jiahui Wang, Haiyue Zhu, Haoren Guo, Abdullah Al Mamun, Cheng Xiang, Tong Heng Lee

PDF

Open Access 1 Video

TL;DR

EPSegFZ introduces a pre-training-free, language-guided point cloud segmentation model that enhances few- and zero-shot performance by integrating novel attention modules and textual support information.

Contribution

The paper proposes a novel pre-training-free network with modules that leverage textual data, improving few- and zero-shot 3D point cloud segmentation performance.

Findings

01

Outperforms state-of-the-art by 5.68% on S3DIS

02

Outperforms state-of-the-art by 3.82% on ScanNet

03

Effective use of textual support data enhances zero-shot capabilities

Abstract

Recent approaches for few-shot 3D point cloud semantic segmentation typically require a two-stage learning process, i.e., a pre-training stage followed by a few-shot training stage. While effective, these methods face overreliance on pre-training, which hinders model flexibility and adaptability. Some models tried to avoid pre-training yet failed to capture ample information. In addition, current approaches focus on visual information in the support set and neglect or do not fully exploit other useful data, such as textual annotations. This inadequate utilization of support information impairs the performance of the model and restricts its zero-shot ability. To address these limitations, we present a novel pre-training-free network, named Efficient Point Cloud Semantic Segmentation for Few- and Zero-shot scenarios. Our EPSegFZ incorporates three key components. A Prototype-Enhanced…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

EPSegFZ: Efficient Point Cloud Semantic Segmentation for Few- and Zero-Shot Scenarios with Language Guidance· underline

Taxonomy

Topics3D Shape Modeling and Analysis · Robotics and Sensor-Based Localization · Robot Manipulation and Learning