Global End-Effector Pose Control of an Underactuated Aerial Manipulator via Reinforcement Learning
Shlok Deshmukh, Javier Alonso-Mora, Sihao Sun

TL;DR
This paper presents a reinforcement learning-based control method for a lightweight, underactuated aerial manipulator to achieve precise end-effector positioning and orientation, demonstrating robustness and accuracy in real-world experiments.
Contribution
It introduces a novel reinforcement learning approach for controlling a lightweight, underactuated aerial manipulator's end-effector pose, combining simulation-trained policies with classical controllers.
Findings
Achieved centimeter-level position accuracy in flight.
Demonstrated degree-level orientation precision.
Maintained robust performance under external disturbances.
Abstract
Aerial manipulators, which combine robotic arms with multi-rotor drones, face strict constraints on arm weight and mechanical complexity. In this work, we study a lightweight 2-degree-of-freedom (DoF) arm mounted on a quadrotor via a differential mechanism, capable of full six-DoF end-effector pose control. While the minimal design enables simplicity and reduced payload, it also introduces challenges such as underactuation and sensitivity to external disturbances. To address these, we employ reinforcement learning, training a Proximal Policy Optimization (PPO) agent in simulation to generate feedforward commands for quadrotor acceleration and body rates, along with joint angle targets. These commands are tracked by an incremental nonlinear dynamic inversion (INDI) attitude controller and a PID joint controller, respectively. Flight experiments demonstrate centimeter-level position…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Control of Nonlinear Systems · Robot Manipulation and Learning · Teleoperation and Haptic Systems
