A Data-Efficient Framework for Training and Sim-to-Real Transfer of   Navigation Policies

Homanga Bharadhwaj; Zihan Wang; Yoshua Bengio; Liam Paull

arXiv:1810.04871·cs.RO·October 12, 2018

A Data-Efficient Framework for Training and Sim-to-Real Transfer of Navigation Policies

Homanga Bharadhwaj, Zihan Wang, Yoshua Bengio, Liam Paull

PDF

TL;DR

This paper presents a data-efficient, simulation-based framework for training robot navigation policies that transfer effectively to real environments using meta-learning and adversarial domain adaptation.

Contribution

It introduces a novel planning framework with a gradient-based planner and a meta-learning training strategy, enhanced by adversarial domain transfer for sim-to-real navigation.

Findings

01

Successful transfer of navigation policies from simulation to real robots

02

Effective planning with fewer real-world demonstrations

03

Robust performance across different navigation tasks

Abstract

Learning effective visuomotor policies for robots purely from data is challenging, but also appealing since a learning-based system should not require manual tuning or calibration. In the case of a robot operating in a real environment the training process can be costly, time-consuming, and even dangerous since failures are common at the start of training. For this reason, it is desirable to be able to leverage \textit{simulation} and \textit{off-policy} data to the extent possible to train the robot. In this work, we introduce a robust framework that plans in simulation and transfers well to the real environment. Our model incorporates a gradient-descent based planning module, which, given the initial image and goal image, encodes the images to a lower dimensional latent state and plans a trajectory to reach the goal. The model, consisting of the encoder and planner modules, is trained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.