3D Highlighter: Localizing Regions on 3D Shapes via Text Descriptions

Dale Decatur; Itai Lang; Rana Hanocka

arXiv:2212.11263·cs.GR·December 22, 2022

3D Highlighter: Localizing Regions on 3D Shapes via Text Descriptions

Dale Decatur, Itai Lang, Rana Hanocka

PDF

Open Access 1 Repo

TL;DR

3D Highlighter is a flexible system that localizes semantic regions on 3D shapes based on text descriptions, including out-of-domain concepts, using a neural field guided by CLIP without requiring 3D datasets.

Contribution

It introduces a novel text-guided 3D shape localization method that handles out-of-domain concepts without 3D annotations, leveraging neural fields and CLIP.

Findings

01

Successfully localizes semantic regions on diverse 3D shapes.

02

Handles out-of-domain and non-obvious concepts effectively.

03

Operates without 3D datasets or annotations.

Abstract

We present 3D Highlighter, a technique for localizing semantic regions on a mesh using text as input. A key feature of our system is the ability to interpret "out-of-domain" localizations. Our system demonstrates the ability to reason about where to place non-obviously related concepts on an input 3D shape, such as adding clothing to a bare 3D animal model. Our method contextualizes the text description using a neural field and colors the corresponding region of the shape using a probability-weighted blend. Our neural optimization is guided by a pre-trained CLIP encoder, which bypasses the need for any 3D datasets or 3D annotations. Thus, 3D Highlighter is highly flexible, general, and capable of producing localizations on a myriad of input shapes. Our code is publicly available at https://github.com/threedle/3DHighlighter.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

threedle/3DHighlighter
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Image Processing and 3D Reconstruction · Human Motion and Animation

MethodsContrastive Language-Image Pre-training