Data Scaling for Navigation in Unknown Environments
Lauri Suomela, Naoki Takahata, Sasanka Kuruppu Arachchige, Harry Edelman, Joni-Kristian K\"am\"ar\"ainen

TL;DR
This study demonstrates that large-scale, diverse crowd-sourced data significantly improves the zero-shot generalization of visual navigation policies in unknown environments, with simple models outperforming complex ones.
Contribution
It is the first large-scale analysis showing data diversity outweighs quantity in training effective navigation policies for unseen environments.
Findings
Data diversity reduces navigation errors more than data quantity.
Doubling geographical locations decreases errors by ~15%.
Simple regression models outperform complex architectures with noisy data.
Abstract
Generalization of imitation-learned navigation policies to environments unseen in training remains a major challenge. We address this by conducting the first large-scale study of how data quantity and data diversity affect real-world generalization in end-to-end, map-free visual navigation. Using a curated 4,565-hour crowd-sourced dataset collected across 161 locations in 35 countries, we train policies for point goal navigation and evaluate their closed-loop control performance on sidewalk robots operating in four countries, covering 125 km of autonomous driving. Our results show that large-scale training data enables zero-shot navigation in unknown environments, approaching the performance of policies trained with environment-specific demonstrations. Critically, we find that data diversity is far more important than data quantity. Doubling the number of geographical locations in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗lauriasuo/EarthRovers_MLP-BC_nLoc153_nH-1model· 10 dl10 dl
- 🤗lauriasuo/EarthRovers_MLP-BC_nLoc153_nH-1_inc_testmodel· 24 dl24 dl
- 🤗lauriasuo/EarthRovers_MLP-BC_nLoc64_nH16model· 10 dl10 dl
- 🤗lauriasuo/EarthRovers_MLP-BC_nLoc64_nH8model· 10 dl10 dl
- 🤗lauriasuo/EarthRovers_MLP-BC_nLoc64_nH4model· 10 dl10 dl
- 🤗lauriasuo/EarthRovers_MLP-BC_nLoc64_nH2model· 10 dl10 dl
- 🤗lauriasuo/EarthRovers_ViNT_nLoc32_nH32model· 11 dl11 dl
- 🤗lauriasuo/EarthRovers_NoMAD_nLoc32_nH32model· 11 dl11 dl
- 🤗lauriasuo/EarthRovers_MLP-BC-DINO3_nLoc32_nH32model· 7 dl7 dl
- 🤗lauriasuo/EarthRovers_DP-Unet_nLoc32_nH32model· 22 dl22 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robotics and Sensor-Based Localization · Autonomous Vehicle Technology and Safety
