View-Consistent Hierarchical 3D Segmentation Using Ultrametric Feature Fields
Haodi He, Colton Stearns, Adam W. Harley, Leonidas J. Guibas

TL;DR
This paper introduces a method to convert view-inconsistent multi-granular 2D segmentations into a hierarchical, 3D-consistent representation using an ultrametric feature space within a Neural Radiance Field, improving accuracy and consistency.
Contribution
The work presents a novel ultrametric feature space within NeRF for hierarchical 3D segmentation, addressing view inconsistency and multi-granularity challenges.
Findings
Improved accuracy over baseline methods.
Enhanced viewpoint consistency in 3D segmentation.
Qualitative success in real-world scene segmentation.
Abstract
Large-scale vision foundation models such as Segment Anything (SAM) demonstrate impressive performance in zero-shot image segmentation at multiple levels of granularity. However, these zero-shot predictions are rarely 3D-consistent. As the camera viewpoint changes in a scene, so do the segmentation predictions, as well as the characterizations of "coarse" or "fine" granularity. In this work, we address the challenging task of lifting multi-granular and view-inconsistent image segmentations into a hierarchical and 3D-consistent representation. We learn a novel feature field within a Neural Radiance Field (NeRF) representing a 3D scene, whose segmentation structure can be revealed at different scales by simply using different thresholds on feature distance. Our key idea is to learn an ultrametric feature space, which unlike a Euclidean space, exhibits transitivity in distance-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis · Image Processing and 3D Reconstruction
