TL;DR
This paper introduces occupancy anticipation, enabling agents to infer unobserved environment regions from egocentric RGB-D data, leading to faster spatial awareness and improved exploration and navigation performance in 3D environments.
Contribution
It presents a novel occupancy anticipation method that leverages context from egocentric views and top-down maps to predict unobserved areas, enhancing navigation efficiency.
Findings
Outperforms strong baselines in environment mapping.
Achieves superior exploration and navigation results on Gibson and Matterport3D datasets.
Won the 2020 Habitat PointNav Challenge.
Abstract
State-of-the-art navigation methods leverage a spatial memory to generalize to new environments, but their occupancy maps are limited to capturing the geometric structures directly observed by the agent. We propose occupancy anticipation, where the agent uses its egocentric RGB-D observations to infer the occupancy state beyond the visible regions. In doing so, the agent builds its spatial awareness more rapidly, which facilitates efficient exploration and navigation in 3D environments. By exploiting context in both the egocentric views and top-down maps our model successfully anticipates a broader map of the environment, with performance significantly better than strong baselines. Furthermore, when deployed for the sequential decision-making tasks of exploration and navigation, our model outperforms state-of-the-art methods on the Gibson and Matterport3D datasets. Our approach is the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
