Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments
Jacob Krantz, Stefan Lee

TL;DR
This paper investigates transferring agents from abstract topological environments to realistic continuous 3D environments in Vision-and-Language Navigation, achieving significant success rate improvements and analyzing the causes of performance gaps.
Contribution
It introduces a sim-2-sim transfer approach that enhances VLN-CE performance and provides insights into the challenges of transferring between different environmental paradigms.
Findings
Sim-2-sim transfer improves success rate by +12%.
Transfer does not fully preserve original performance.
Identifies key differences affecting transfer effectiveness.
Abstract
Recent work in Vision-and-Language Navigation (VLN) has presented two environmental paradigms with differing realism -- the standard VLN setting built on topological environments where navigation is abstracted away, and the VLN-CE setting where agents must navigate continuous 3D environments using low-level actions. Despite sharing the high-level task and even the underlying instruction-path data, performance on VLN-CE lags behind VLN significantly. In this work, we explore this gap by transferring an agent from the abstract environment of VLN to the continuous environment of VLN-CE. We find that this sim-2-sim transfer is highly effective, improving over the prior state of the art in VLN-CE by +12% success rate. While this demonstrates the potential for this direction, the transfer does not fully retain the original performance of the agent in the abstract setting. We present a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Natural Language Processing Techniques
