Developmental Reinforcement Learning of Control Policy of a Quadcopter   UAV with Thrust Vectoring Rotors

Aditya M. Deshpande; Rumit Kumar; Ali A. Minai; Manish Kumar

arXiv:2007.07793·cs.RO·July 16, 2020

Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors

Aditya M. Deshpande, Rumit Kumar, Ali A. Minai, Manish Kumar

PDF

1 Repo

TL;DR

This paper introduces a reinforcement learning-based control policy for a tilt-rotor quadcopter, leveraging transfer learning from a simpler UAV to improve learning speed, robustness, and fault tolerance in simulation tasks.

Contribution

It presents a novel developmental reinforcement learning approach that transfers policies from a simple quadcopter to a more complex tilt-rotor UAV, enhancing learning efficiency and robustness.

Findings

01

Faster learning of control policies compared to learning from scratch.

02

Demonstrated robustness in recovering from non-static initial conditions.

03

Superior fault tolerance of transferred policies over scratch-learned policies.

Abstract

In this paper, we present a novel developmental reinforcement learning-based controller for a quadcopter with thrust vectoring capabilities. This multirotor UAV design has tilt-enabled rotors. It utilizes the rotor force magnitude and direction to achieve the desired state during flight. The control policy of this robot is learned using the policy transfer from the learned controller of the quadcopter (comparatively simple UAV design without thrust vectoring). This approach allows learning a control policy for systems with multiple inputs and multiple outputs. The performance of the learned policy is evaluated by physics-based simulations for the tasks of hovering and way-point navigation. The flight simulations utilize a flight controller based on reinforcement learning without any additional PID components. The results show faster learning with the presented approach as opposed to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adipandas/gym_multirotor
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAdam · Dense Connections · Feedforward Network · Proximal Policy Optimization