TL;DR
This paper introduces a reinforcement learning-based control policy for a tilt-rotor quadcopter, leveraging transfer learning from a simpler UAV to improve learning speed, robustness, and fault tolerance in simulation tasks.
Contribution
It presents a novel developmental reinforcement learning approach that transfers policies from a simple quadcopter to a more complex tilt-rotor UAV, enhancing learning efficiency and robustness.
Findings
Faster learning of control policies compared to learning from scratch.
Demonstrated robustness in recovering from non-static initial conditions.
Superior fault tolerance of transferred policies over scratch-learned policies.
Abstract
In this paper, we present a novel developmental reinforcement learning-based controller for a quadcopter with thrust vectoring capabilities. This multirotor UAV design has tilt-enabled rotors. It utilizes the rotor force magnitude and direction to achieve the desired state during flight. The control policy of this robot is learned using the policy transfer from the learned controller of the quadcopter (comparatively simple UAV design without thrust vectoring). This approach allows learning a control policy for systems with multiple inputs and multiple outputs. The performance of the learned policy is evaluated by physics-based simulations for the tasks of hovering and way-point navigation. The flight simulations utilize a flight controller based on reinforcement learning without any additional PID components. The results show faster learning with the presented approach as opposed to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAdam · Dense Connections · Feedforward Network · Proximal Policy Optimization
