GARField: Group Anything with Radiance Fields

Chung Min Kim; Mingxuan Wu; Justin Kerr; Ken Goldberg; Matthew Tancik,; Angjoo Kanazawa

arXiv:2401.09419·cs.CV·January 18, 2024·2 cites

GARField: Group Anything with Radiance Fields

Chung Min Kim, Mingxuan Wu, Justin Kerr, Ken Goldberg, Matthew Tancik,, Angjoo Kanazawa

PDF

Open Access 1 Repo

TL;DR

GARField is a novel method that decomposes 3D scenes into hierarchical, semantically meaningful groups by leveraging scale-conditioned affinity fields, enabling multi-level scene understanding from posed images.

Contribution

It introduces a scale-conditioned affinity field approach that captures multi-level scene groupings and respects ambiguity, improving 3D scene decomposition from 2D masks.

Findings

01

Effectively extracts multi-level scene groups including objects and parts

02

Produces higher fidelity groups than input masks from SAM

03

Ensures multi-view consistency in groupings

Abstract

Grouping is inherently ambiguous due to the multiple levels of granularity in which one can decompose a scene -- should the wheels of an excavator be considered separate or part of the whole? We present Group Anything with Radiance Fields (GARField), an approach for decomposing 3D scenes into a hierarchy of semantically meaningful groups from posed image inputs. To do this we embrace group ambiguity through physical scale: by optimizing a scale-conditioned 3D affinity feature field, a point in the world can belong to different groups of different sizes. We optimize this field from a set of 2D masks provided by Segment Anything (SAM) in a way that respects coarse-to-fine hierarchy, using scale to consistently fuse conflicting masks from different viewpoints. From this field we can derive a hierarchy of possible groupings via automatic tree construction or user interaction. We evaluate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chungmin99/garfield
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Surveying and Cultural Heritage · Advanced Vision and Imaging · Robotics and Sensor-Based Localization

MethodsSparse Evolutionary Training · Segment Anything Model