Hovering Flight of Soft-Actuated Insect-Scale Micro Aerial Vehicles   using Deep Reinforcement Learning

Yi-Hsuan Hsiao; Wei-Tung Chen; Yun-Sheng Chang; Pulkit Agrawal and; YuFeng Chen

arXiv:2502.12355·cs.RO·February 19, 2025

Hovering Flight of Soft-Actuated Insect-Scale Micro Aerial Vehicles using Deep Reinforcement Learning

Yi-Hsuan Hsiao, Wei-Tung Chen, Yun-Sheng Chang, Pulkit Agrawal and, YuFeng Chen

PDF

Open Access

TL;DR

This paper develops a deep reinforcement learning controller for soft-actuated insect-scale micro aerial vehicles, enabling robust, zero-shot hovering flights despite system delays and uncertainties, with successful real-world deployment.

Contribution

It introduces a novel combined approach of behavior cloning with state-action re-matching and RL fine-tuning for controlling soft IMAVs, achieving first end-to-end deep RL flight.

Findings

01

Deep RL controller enables stable hovering flights.

02

First successful real-world deep RL flight on soft IMAVs.

03

Achieves 50-second hover with low positional error.

Abstract

Soft-actuated insect-scale micro aerial vehicles (IMAVs) pose unique challenges for designing robust and computationally efficient controllers. At the millimeter scale, fast robot dynamics ( $\sim$ ms), together with system delay, model uncertainty, and external disturbances significantly affect flight performances. Here, we design a deep reinforcement learning (RL) controller that addresses system delay and uncertainties. To initialize this neural network (NN) controller, we propose a modified behavior cloning (BC) approach with state-action re-matching to account for delay and domain-randomized expert demonstration to tackle uncertainty. Then we apply proximal policy optimization (PPO) to fine-tune the policy during RL, enhancing performance and smoothing commands. In simulations, our modified BC substantially increases the mean reward compared to baseline BC; and RL with PPO improves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomimetic flight and propulsion mechanisms · Micro and Nano Robotics · Fluid Dynamics and Turbulent Flows

MethodsEntropy Regularization · Proximal Policy Optimization