LaTeRF: Label and Text Driven Object Radiance Fields
Ashkan Mirzaei, Yash Kant, Jonathan Kelly, and Igor Gilitschenski

TL;DR
LaTeRF is a novel method that extracts 3D object representations from scenes using weak supervision, natural language, and point-labels, leveraging CLIP and NeRF extensions for high-fidelity results.
Contribution
It introduces a new approach combining language, point-labels, and NeRF extensions to improve 3D object extraction from scenes with minimal supervision.
Findings
High-fidelity object extraction demonstrated on synthetic datasets.
Effective occlusion inpainting using CLIP's latent space.
Extensive ablation studies validate design choices.
Abstract
Obtaining 3D object representations is important for creating photo-realistic simulations and for collecting AR and VR assets. Neural fields have shown their effectiveness in learning a continuous volumetric representation of a scene from 2D images, but acquiring object representations from these models with weak supervision remains an open challenge. In this paper we introduce LaTeRF, a method for extracting an object of interest from a scene given 2D images of the entire scene, known camera poses, a natural language description of the object, and a set of point-labels of object and non-object points in the input images. To faithfully extract the object from the scene, LaTeRF extends the NeRF formulation with an additional `objectness' probability at each 3D point. Additionally, we leverage the rich latent space of a pre-trained CLIP model combined with our differentiable object…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Advanced Neural Network Applications · 3D Surveying and Cultural Heritage
MethodsContrastive Language-Image Pre-training
