Bi-Level Reinforcement Learning Control for an Underactuated Blimp via Center-of-Mass Reconfiguration

Xiaorui Wang; Hongwu Wang; Yue Fan; Hao Cheng; Feitian Zhang

arXiv:2605.01289·cs.RO·May 5, 2026

Bi-Level Reinforcement Learning Control for an Underactuated Blimp via Center-of-Mass Reconfiguration

Xiaorui Wang, Hongwu Wang, Yue Fan, Hao Cheng, Feitian Zhang

PDF

TL;DR

This paper presents a bi-level reinforcement learning approach for controlling an underactuated blimp with a movable center-of-mass, improving energy efficiency and tracking accuracy through explicit CoM planning and thrust control.

Contribution

It introduces a novel bi-level RL framework that decouples CoM planning from thrust control for an underactuated blimp, supported by a two-stage learning strategy and convergence analysis.

Findings

01

Outperforms fixed-CoM and PID controllers in accuracy and robustness.

02

Enables reliable sim-to-real transfer for the blimp control.

03

Demonstrates effectiveness through extensive simulations and real-world experiments.

Abstract

This paper investigates goal-directed tracking control of underactuated blimps with center-of-mass (CoM) reconfiguration. Unlike conventional overactuated blimp designs that rely on redundant actuation for simplified control, this paper focuses on a compact architecture consisting of two thrusters and a movable internal slider, aiming to improve energy efficiency and payload capacity. This hardware-efficient configuration introduces significant underactuation and strong nonlinear coupling between CoM dynamics and vehicle motion. To address these challenges, this paper proposes a bi-level reinforcement learning framework that explicitly decouples task-level CoM planning from continuous thrust control. The outer policy determines a target-dependent CoM configuration prior to flight, while the inner policy generates thrust commands to track straight-line references. To ensure stable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.