Actuator Trajectory Planning for UAVs with Overhead Manipulator using Reinforcement Learning
Hazim Alzorgan, Abolfazl Razi, Ata Jahangir Moshayedi

TL;DR
This paper presents a reinforcement learning approach for controlling an aerial manipulator system on a UAV, enabling precise trajectory tracking of the manipulator's end-effector for complex tasks in challenging environments.
Contribution
It introduces a novel Q-learning based control framework combining obstacle-aware motion planning and independent end-effector trajectory control for UAV manipulators.
Findings
Achieved 92% accuracy in trajectory tracking.
Developed a robust control strategy handling UAV motion uncertainties.
Enabled complex actuation tasks in risky environments.
Abstract
In this paper, we investigate the operation of an aerial manipulator system, namely an Unmanned Aerial Vehicle (UAV) equipped with a controllable arm with two degrees of freedom to carry out actuation tasks on the fly. Our solution is based on employing a Q-learning method to control the trajectory of the tip of the arm, also called end-effector. More specifically, we develop a motion planning model based on Time To Collision (TTC), which enables a quadrotor UAV to navigate around obstacles while ensuring the manipulator's reachability. Additionally, we utilize a model-based Q-learning model to independently track and control the desired trajectory of the manipulator's end-effector, given an arbitrary baseline trajectory for the UAV platform. Such a combination enables a variety of actuation tasks such as high-altitude welding, structural monitoring and repair, battery replacement,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Robotics and Sensor-Based Localization
MethodsQ-Learning
