Unlocking the Potential of Simulators: Design with RL in Mind
Rika Antonova, Silvia Cruciani

TL;DR
This paper demonstrates that designing simple, RL-compatible simulators that model control and dynamics can outperform high-fidelity simulators in training policies for real-world robotic tasks, especially when key uncertainties are identified.
Contribution
The authors introduce a novel approach to simulator design that emphasizes control modeling and show its effectiveness in robotic policy learning, challenging the reliance on high-fidelity simulators.
Findings
Simple RL-compatible simulators outperform high-fidelity ones in policy transfer.
Modeling control and key uncertainties enables effective policy learning.
Exploiting phenomena like friction can improve real-world policy performance.
Abstract
Using Reinforcement Learning (RL) in simulation to construct policies useful in real life is challenging. This is often attributed to the sequential decision making aspect: inaccuracies in simulation accumulate over multiple steps, hence the simulated trajectories diverge from what would happen in reality. In our work we show the need to consider another important aspect: the mismatch in simulating control. We bring attention to the need for modeling control as well as dynamics, since oversimplifying assumptions about applying actions of RL policies could make the policies fail on real-world systems. We design a simulator for solving a pivoting task (of interest in Robotics) and demonstrate that even a simple simulator designed with RL in mind outperforms high-fidelity simulators when it comes to learning a policy that is to be deployed on a real robotic system. We show that a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Machine Learning and Algorithms
