NeRF-Supervision: Learning Dense Object Descriptors from Neural Radiance Fields
Lin Yen-Chen, Pete Florence, Jonathan T. Barron, Tsung-Yi Lin, Alberto, Rodriguez, Phillip Isola

TL;DR
This paper introduces a novel method using Neural Radiance Fields (NeRF) to generate dense object descriptors for challenging thin and reflective objects, significantly improving robotic perception and manipulation capabilities.
Contribution
The authors propose a new NeRF-based supervision technique for learning view-invariant dense object descriptors, outperforming existing methods in accuracy and enabling precise robot manipulation.
Findings
Dense descriptors outperform off-the-shelf methods by 106%.
NeRF-based supervision improves descriptor accuracy over multi-view stereo baseline.
Robots successfully perform 6-DoF pick and place of challenging objects.
Abstract
Thin, reflective objects such as forks and whisks are common in our daily lives, but they are particularly challenging for robot perception because it is hard to reconstruct them using commodity RGB-D cameras or multi-view stereo techniques. While traditional pipelines struggle with objects like these, Neural Radiance Fields (NeRFs) have recently been shown to be remarkably effective for performing view synthesis on objects with thin structures or reflective materials. In this paper we explore the use of NeRF as a new source of supervision for robust robot vision systems. In particular, we demonstrate that a NeRF representation of a scene can be used to train dense object descriptors. We use an optimized NeRF to extract dense correspondences between multiple views of an object, and then use these correspondences as training data for learning a view-invariant representation of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Image Processing Techniques and Applications
