AutoNeRF: Training Implicit Scene Representations with Autonomous Agents

Pierre Marza; Laetitia Matignon; Olivier Simonin; Dhruv Batra,; Christian Wolf; Devendra Singh Chaplot

arXiv:2304.11241·cs.CV·December 25, 2023·5 cites

AutoNeRF: Training Implicit Scene Representations with Autonomous Agents

Pierre Marza, Laetitia Matignon, Olivier Simonin, Dhruv Batra,, Christian Wolf, Devendra Singh Chaplot

PDF

Open Access 1 Repo 3 Reviews

TL;DR

AutoNeRF enables autonomous agents to collect data and train neural radiance fields in unseen environments, facilitating efficient scene understanding and downstream robotic tasks without manual data collection.

Contribution

This work introduces AutoNeRF, a novel autonomous data collection method for training NeRFs, reducing manual effort and enabling scene-specific adaptation for robotic applications.

Findings

01

NeRFs can be trained with a single exploration episode in unseen environments.

02

Modular exploration models outperform classical and end-to-end baselines.

03

AutoNeRF effectively reconstructs large-scale scenes for scene-specific adaptation.

Abstract

Implicit representations such as Neural Radiance Fields (NeRF) have been shown to be very effective at novel view synthesis. However, these models typically require manual and careful human data collection for training. In this paper, we present AutoNeRF, a method to collect data required to train NeRFs using autonomous embodied agents. Our method allows an agent to explore an unseen environment efficiently and use the experience to build an implicit map representation autonomously. We compare the impact of different exploration strategies including handcrafted frontier-based exploration, end-to-end and modular approaches composed of trained high-level planners and classical low-level path followers. We train these models with different reward functions tailored to this problem and evaluate the quality of the learned representations on four different downstream tasks: classical…

Peer Reviews

Decision·Submitted to ICLR 2024

Reviewer 01Rating 3· reject, not good enoughConfidence 2

Strengths

1. The proposed autonomous approach for embodied agents collecting NeRF visual training data greatly reduces human intervention. 2. The authors conduct a comprehensive evaluation assessing the quality of the reconstructed scene produced by four different autonomous data collecting approaches. The assessment is carried out by evaluating each approach based on four downstream tasks which indicates the actual performance on follow-up applications.

Weaknesses

1. The scientific contribution of this paper is unclear. Although the authors conduct a comprehensive evaluation of the designed autonomous approaches, it does not point out their core contributions to this field. The proposed autonomous exploration strategy is straight-forward, and could not be considered as main contribution of this paper. Additionally, the absence of a comparison between their methods and existing techniques in autonomous visual exploration—which could have been utilized for

Reviewer 02Rating 5· marginally below the acceptance thresholdConfidence 3

Strengths

The authors correctly emphasize the need for more realistic tasks when evaluating NeRF in navigation scenarios. Furthermore, they have introduced effective rewards that yield improved NeRF results in the various suggested tasks.

Weaknesses

As the authors attempt to cover a wide range of tasks, the presentation of results lacks organization, making it challenging for readers to interpret and examine the outcomes. Further weaknesses are outlined in the "Questions" section.

Reviewer 03Rating 6· marginally above the acceptance thresholdConfidence 3

Strengths

1. The paper explores multiple exploration policies for collecting training samples for a scene NERF, which provides a comprehensive analysis of the community. 2. The policies are further evaluated using different downstream robotic tasks, which is beneficial to related researchers. 3. The experimental results are comprehensive and solid. 4. The paper is well-written and easy to follow.

Weaknesses

1. The novelty of the paper is limited. The idea of using Nerf for scene construction in SLAM is not new. The authors also adopted an off-the-shelf NERF module. The main contribution of this paper lies in validating the effect of different exploration policies during image collection on downstream robotics tasks.

Code & Models

Repositories

PierreMarza/autonerf
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Robotics and Sensor-Based Localization · Advanced Neural Network Applications