Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement   Learning for Planned-Ahead Vision-and-Language Navigation

Xin Wang; Wenhan Xiong; Hongmin Wang; William Yang Wang

arXiv:1803.07729·cs.CV·July 27, 2018·22 cites

Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language Navigation

Xin Wang, Wenhan Xiong, Hongmin Wang, William Yang Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hybrid reinforcement learning approach combining model-free and model-based methods for vision-and-language navigation, significantly improving real-world performance and generalization over synthetic models.

Contribution

A novel planned-ahead hybrid RL model that integrates environment prediction with policy planning for improved real-world navigation.

Findings

01

Outperforms baseline models on Room-to-Room dataset

02

Achieves superior generalization to unseen environments

03

Effectively combines model-free and model-based RL for navigation

Abstract

Existing research studies on vision and language grounding for robot navigation focus on improving model-free deep reinforcement learning (DRL) models in synthetic environments. However, model-free DRL models do not consider the dynamics in the real-world environments, and they often fail to generalize to new scenes. In this paper, we take a radical approach to bridge the gap between synthetic studies and real-world practices---We propose a novel, planned-ahead hybrid reinforcement learning model that combines model-free and model-based reinforcement learning to solve a real-world vision-language navigation task. Our look-ahead module tightly integrates a look-ahead policy model with an environment model that predicts the next state and the reward. Experimental results suggest that our proposed method significantly outperforms the baselines and achieves the best on the real-world…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

peteanderson80/Matterport3DSimulator
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics