Unlocking the Potential of Simulators: Design with RL in Mind

Rika Antonova; Silvia Cruciani

arXiv:1706.02501·cs.LG·June 9, 2017·1 cites

Unlocking the Potential of Simulators: Design with RL in Mind

Rika Antonova, Silvia Cruciani

PDF

Open Access

TL;DR

This paper demonstrates that designing simple, RL-compatible simulators that model control and dynamics can outperform high-fidelity simulators in training policies for real-world robotic tasks, especially when key uncertainties are identified.

Contribution

The authors introduce a novel approach to simulator design that emphasizes control modeling and show its effectiveness in robotic policy learning, challenging the reliance on high-fidelity simulators.

Findings

01

Simple RL-compatible simulators outperform high-fidelity ones in policy transfer.

02

Modeling control and key uncertainties enables effective policy learning.

03

Exploiting phenomena like friction can improve real-world policy performance.

Abstract

Using Reinforcement Learning (RL) in simulation to construct policies useful in real life is challenging. This is often attributed to the sequential decision making aspect: inaccuracies in simulation accumulate over multiple steps, hence the simulated trajectories diverge from what would happen in reality. In our work we show the need to consider another important aspect: the mismatch in simulating control. We bring attention to the need for modeling control as well as dynamics, since oversimplifying assumptions about applying actions of RL policies could make the policies fail on real-world systems. We design a simulator for solving a pivoting task (of interest in Robotics) and demonstrate that even a simple simulator designed with RL in mind outperforms high-fidelity simulators when it comes to learning a policy that is to be deployed on a real robotic system. We show that a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Machine Learning and Algorithms