Validate on Sim, Detect on Real -- Model Selection for Domain Randomization
Gal Leibovich, Guy Jacob, Shadi Endrawis, Gal Novik, Aviv Tamar

TL;DR
This paper introduces VSDR, a method combining out-of-distribution detection and simulation evaluation to rank policies for sim2real transfer, reducing real-world testing and improving selection accuracy.
Contribution
The paper proposes VSDR, a novel policy scoring method that enhances policy ranking accuracy by integrating OOD detection with simulation evaluation, reducing reliance on costly real-world testing.
Findings
VSDR significantly improves policy ranking accuracy.
VSDR reduces the amount of real-world data needed.
VSDR outperforms baseline methods in robotic grasping tasks.
Abstract
A practical approach to learning robot skills, often termed sim2real, is to train control policies in simulation and then deploy them on a real robot. Popular techniques to improve the sim2real transfer build on domain randomization (DR) -- training the policy on a diverse set of randomly generated domains with the hope of better generalization to the real world. Due to the large number of hyper-parameters in both the policy learning and DR algorithms, one often ends up with a large number of trained policies, where choosing the best policy among them demands costly evaluation on the real robot. In this work we ask - can we rank the policies without running them in the real world? Our main idea is that a predefined set of real world data can be used to evaluate all policies, using out-of-distribution detection (OOD) techniques. In a sense, this approach can be seen as a `unit test' to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics · Topic Modeling
