Robust Deep Reinforcement Learning for Quadcopter Control

Aditya M. Deshpande; Ali A. Minai; Manish Kumar

arXiv:2111.03915·cs.RO·November 9, 2021

Robust Deep Reinforcement Learning for Quadcopter Control

Aditya M. Deshpande, Ali A. Minai, Manish Kumar

PDF

Open Access 1 Repo

TL;DR

This paper introduces a robust deep reinforcement learning approach for quadcopter control that enhances policy transferability across varying environments by integrating Robust Markov Decision Processes, leading to improved generalization and adaptability.

Contribution

It proposes a novel robust RL method using RMDP for drone control, improving transferability and robustness over standard RL policies.

Findings

01

Robust policies outperform standard agents in unseen environments.

02

Increased robustness enhances generalization to non-stationary environments.

03

Method demonstrates effective transfer from simulation to varied test conditions.

Abstract

Deep reinforcement learning (RL) has made it possible to solve complex robotics problems using neural networks as function approximators. However, the policies trained on stationary environments suffer in terms of generalization when transferred from one environment to another. In this work, we use Robust Markov Decision Processes (RMDP) to train the drone control policy, which combines ideas from Robust Control and RL. It opts for pessimistic optimization to handle potential gaps between policy transfer from one environment to another. The trained control policy is tested on the task of quadcopter positional control. RL agents were trained in a MuJoCo simulator. During testing, different environment parameters (unseen during the training) were used to validate the robustness of the trained policy for transfer from one environment to another. The robust policy outperformed the standard…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adipandas/gym_multirotor
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · UAV Applications and Optimization