DoShiCo Challenge: Domain Shift in Control Prediction
Klaas Kelchtermans, Tinne Tuytelaars

TL;DR
This paper introduces the DoShiCo challenge, a benchmark for training control policies in simple synthetic environments that generalize to realistic scenarios, focusing on drone collision avoidance and domain shift robustness.
Contribution
It proposes a new benchmark combining domain shift challenges in reinforcement learning and computer vision, and demonstrates a baseline method using depth prediction to improve transferability.
Findings
Baseline policy successfully avoids collisions in realistic environments
Depth prediction auxiliary task aids in overcoming domain shift
Benchmark encourages research on domain transfer in control tasks
Abstract
Training deep neural network policies end-to-end for real-world applications so far requires big demonstration datasets in the real world or big sets consisting of a large variety of realistic and closely related 3D CAD models. These real or virtual data should, moreover, have very similar characteristics to the conditions expected at test time. These stringent requirements and the time consuming data collection processes that they entail, are currently the most important impediment that keeps deep reinforcement learning from being deployed in real-world applications. Therefore, in this work we advocate an alternative approach, where instead of avoiding any domain shift by carefully selecting the training data, the goal is to learn a policy that can cope with it. To this end, we propose the DoShiCo challenge: to train a model in very basic synthetic environments, far from realistic, in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Reinforcement Learning in Robotics
