Hovering Flight of Soft-Actuated Insect-Scale Micro Aerial Vehicles using Deep Reinforcement Learning
Yi-Hsuan Hsiao, Wei-Tung Chen, Yun-Sheng Chang, Pulkit Agrawal and, YuFeng Chen

TL;DR
This paper develops a deep reinforcement learning controller for soft-actuated insect-scale micro aerial vehicles, enabling robust, zero-shot hovering flights despite system delays and uncertainties, with successful real-world deployment.
Contribution
It introduces a novel combined approach of behavior cloning with state-action re-matching and RL fine-tuning for controlling soft IMAVs, achieving first end-to-end deep RL flight.
Findings
Deep RL controller enables stable hovering flights.
First successful real-world deep RL flight on soft IMAVs.
Achieves 50-second hover with low positional error.
Abstract
Soft-actuated insect-scale micro aerial vehicles (IMAVs) pose unique challenges for designing robust and computationally efficient controllers. At the millimeter scale, fast robot dynamics (ms), together with system delay, model uncertainty, and external disturbances significantly affect flight performances. Here, we design a deep reinforcement learning (RL) controller that addresses system delay and uncertainties. To initialize this neural network (NN) controller, we propose a modified behavior cloning (BC) approach with state-action re-matching to account for delay and domain-randomized expert demonstration to tackle uncertainty. Then we apply proximal policy optimization (PPO) to fine-tune the policy during RL, enhancing performance and smoothing commands. In simulations, our modified BC substantially increases the mean reward compared to baseline BC; and RL with PPO improves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomimetic flight and propulsion mechanisms · Micro and Nano Robotics · Fluid Dynamics and Turbulent Flows
MethodsEntropy Regularization · Proximal Policy Optimization
