DoShiCo Challenge: Domain Shift in Control Prediction

Klaas Kelchtermans; Tinne Tuytelaars

arXiv:1710.09860·cs.RO·April 13, 2018

DoShiCo Challenge: Domain Shift in Control Prediction

Klaas Kelchtermans, Tinne Tuytelaars

PDF

Open Access

TL;DR

This paper introduces the DoShiCo challenge, a benchmark for training control policies in simple synthetic environments that generalize to realistic scenarios, focusing on drone collision avoidance and domain shift robustness.

Contribution

It proposes a new benchmark combining domain shift challenges in reinforcement learning and computer vision, and demonstrates a baseline method using depth prediction to improve transferability.

Findings

01

Baseline policy successfully avoids collisions in realistic environments

02

Depth prediction auxiliary task aids in overcoming domain shift

03

Benchmark encourages research on domain transfer in control tasks

Abstract

Training deep neural network policies end-to-end for real-world applications so far requires big demonstration datasets in the real world or big sets consisting of a large variety of realistic and closely related 3D CAD models. These real or virtual data should, moreover, have very similar characteristics to the conditions expected at test time. These stringent requirements and the time consuming data collection processes that they entail, are currently the most important impediment that keeps deep reinforcement learning from being deployed in real-world applications. Therefore, in this work we advocate an alternative approach, where instead of avoiding any domain shift by carefully selecting the training data, the goal is to learn a policy that can cope with it. To this end, we propose the DoShiCo challenge: to train a model in very basic synthetic environments, far from realistic, in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Reinforcement Learning in Robotics