Learning Quadrotor Control From Visual Features Using Differentiable Simulation
Johannes Heeg, Yunlong Song, Davide Scaramuzza

TL;DR
This paper demonstrates that differentiable simulation can significantly improve sample efficiency and training speed in learning quadrotor control from visual features, outperforming traditional model-free RL methods.
Contribution
It introduces a differentiable simulation approach for quadrotor control that combines surrogate models and state representation learning to enhance training efficiency and effectiveness.
Findings
Differentiable simulation outperforms model-free RL in sample efficiency.
Training time is reduced from hours to minutes.
Combining state representation learning accelerates convergence.
Abstract
The sample inefficiency of reinforcement learning (RL) remains a significant challenge in robotics. RL requires large-scale simulation and can still cause long training times, slowing research and innovation. This issue is particularly pronounced in vision-based control tasks where reliable state estimates are not accessible. Differentiable simulation offers an alternative by enabling gradient back-propagation through the dynamics model, providing low-variance analytical policy gradients and, hence, higher sample efficiency. However, its usage for real-world robotic tasks has yet been limited. This work demonstrates the great potential of differentiable simulation for learning quadrotor control. We show that training in differentiable simulation significantly outperforms model-free RL in terms of both sample efficiency and training time, allowing a policy to learn to recover a quadrotor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Autonomous Vehicle Technology and Safety
