DiSCO-3D : Discovering and segmenting Sub-Concepts from Open-vocabulary queries in NeRF

Doriand Petit; Steve Bourgeois; Vincent Gay-Bellile; Florian Chabot; Lo\"ic Barthe

arXiv:2507.14596·cs.CV·July 22, 2025

DiSCO-3D : Discovering and segmenting Sub-Concepts from Open-vocabulary queries in NeRF

Doriand Petit, Steve Bourgeois, Vincent Gay-Bellile, Florian Chabot, Lo\"ic Barthe

PDF

TL;DR

DiSCO-3D introduces a novel method for 3D semantic segmentation that combines unsupervised learning with open-vocabulary guidance, enabling discovery of sub-concepts in scenes based on user queries.

Contribution

It is the first approach to address 3D open-vocabulary sub-concept discovery by integrating neural fields with weak guidance and unsupervised segmentation.

Findings

01

Achieves effective open-vocabulary sub-concept discovery in 3D scenes.

02

Outperforms existing methods in open-vocabulary and unsupervised segmentation tasks.

03

Demonstrates state-of-the-art results in challenging edge cases.

Abstract

3D semantic segmentation provides high-level scene understanding for applications in robotics, autonomous systems, \textit{etc}. Traditional methods adapt exclusively to either task-specific goals (open-vocabulary segmentation) or scene content (unsupervised semantic segmentation). We propose DiSCO-3D, the first method addressing the broader problem of 3D Open-Vocabulary Sub-concepts Discovery, which aims to provide a 3D semantic segmentation that adapts to both the scene and user queries. We build DiSCO-3D on Neural Fields representations, combining unsupervised segmentation with weak open-vocabulary guidance. Our evaluations demonstrate that DiSCO-3D achieves effective performance in Open-Vocabulary Sub-concepts Discovery and exhibits state-of-the-art results in the edge cases of both open-vocabulary and unsupervised segmentation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.