TL;DR
This paper introduces an end-to-end reinforcement learning approach using high-fidelity differentiable simulation to enable quadrotors to perform obstacle avoidance at high speeds with stable, safe, and efficient control directly from depth images.
Contribution
The authors propose a novel low-level control policy trained via differentiable simulation that directly maps depth images to bodyrate commands, improving flight stability and generalization.
Findings
Achieves the highest success rate in obstacle avoidance benchmarks.
Demonstrates stable flight at speeds up to 7.5 m/s in outdoor environments.
Successfully deploys zero-shot in unseen, dense forest environments.
Abstract
Obstacle avoidance is a fundamental vision-based task essential for enabling quadrotors to perform advanced applications. When planning the trajectory, existing approaches both on optimization and learning typically regard quadrotor as a point-mass model, giving path or velocity commands then tracking the commands by outer-loop controller. However, at high speeds, planned trajectories sometimes become dynamically infeasible in actual flight, which beyond the capacity of controller. In this paper, we propose a novel end-to-end policy that directly maps depth images to low-level bodyrate commands by reinforcement learning via differentiable simulation. The high-fidelity simulation in training after parameter identification significantly reduces all the gaps between training, simulation and real world. Analytical process by differentiable simulation provides accurate gradient to ensure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
