DROID: Minimizing the Reality Gap using Single-Shot Human Demonstration
Ya-Yen Tsai, Hui Xu, Zihan Ding, Chong Zhang, Edward Johns, Bidan, Huang

TL;DR
This paper introduces DROID, a framework that uses a single human demonstration to identify and optimize the simulator's dynamics parameters, improving the transfer of policies from simulation to real-world tasks like door opening.
Contribution
DROID is a novel method that leverages single-shot human demonstrations to better match simulation dynamics with reality, enhancing policy transfer and generalization.
Findings
DROID accurately identifies the dynamics discrepancy between simulation and real world.
Optimizing simulator parameters improves policy transfer success.
The method generalizes to related tasks beyond door opening.
Abstract
Reinforcement learning (RL) has demonstrated great success in the past several years. However, most of the scenarios focus on simulated environments. One of the main challenges of transferring the policy learned in a simulated environment to real world, is the discrepancy between the dynamics of the two environments. In prior works, Domain Randomization (DR) has been used to address the reality gap for both robotic locomotion and manipulation tasks. In this paper, we propose Domain Randomization Optimization IDentification (DROID), a novel framework to exploit single-shot human demonstration for identifying the simulator's distribution of dynamics parameters, and apply it to training a policy on a door opening task. Our results show that the proposed framework can identify the difference in dynamics between the simulated and the real worlds, and thus improve policy transfer by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robotic Locomotion and Control · Model Reduction and Neural Networks
