NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields

Eric Zhu; Mara Levy; Matthew Gwilliam; Abhinav Shrivastava

arXiv:2411.02482·cs.RO·September 16, 2025

NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields

Eric Zhu, Mara Levy, Matthew Gwilliam, Abhinav Shrivastava

PDF

Open Access

TL;DR

NeRF-Aug uses neural radiance fields to generate photorealistic, fast, and consistent augmented data for robotic policy training, significantly improving generalization to unseen objects.

Contribution

The paper introduces NeRF-Aug, a novel data augmentation method leveraging neural radiance fields for improved robotic policy generalization to unseen objects.

Findings

01

NeRF-Aug creates more photorealistic augmented data.

02

NeRF-Aug runs 63% faster than existing methods.

03

Achieves 55.6% average performance boost on unseen objects.

Abstract

Training a policy that can generalize to unknown objects is a long standing challenge within the field of robotics. The performance of a policy often drops significantly in situations where an object in the scene was not seen during training. To solve this problem, we present NeRF-Aug, a novel method that is capable of teaching a policy to interact with objects that are not present in the dataset. This approach differs from existing approaches by leveraging the speed, photorealism, and 3D consistency of a neural radiance field for augmentation. NeRF-Aug both creates more photorealistic data and runs 63% faster than existing methods. We demonstrate the effectiveness of our method on 5 tasks with 9 novel objects that are not present in the expert demonstrations. We achieve an average performance boost of 55.6% when comparing our method to the next best method. You can see video results at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Image Segmentation Techniques · Advanced Neural Network Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings