Retro-RL: Reinforcing Nominal Controller With Deep Reinforcement   Learning for Tilting-Rotor Drones

I Made Aswin Nahrendra; Christian Tirtawardhana; Byeongho Yu,; Eungchang Mason Lee; Hyun Myung

arXiv:2207.03124·cs.RO·July 8, 2022

Retro-RL: Reinforcing Nominal Controller With Deep Reinforcement Learning for Tilting-Rotor Drones

I Made Aswin Nahrendra, Christian Tirtawardhana, Byeongho Yu,, Eungchang Mason Lee, Hyun Myung

PDF

Open Access

TL;DR

This paper introduces a hybrid control architecture for tilting-rotor drones that combines a nominal controller with a deep reinforcement learning policy, enhancing robustness and stability in complex tasks like wall climbing.

Contribution

It proposes an uncertainty-aware control mixer that integrates a learned policy with a nominal controller, ensuring stability and robustness for real-world drone applications.

Findings

01

The hybrid approach outperforms conventional controllers in real-world tests.

02

The method maintains stability while leveraging deep RL for robustness.

03

Extensive domain randomization improves policy generalization.

Abstract

Studies that broaden drone applications into complex tasks require a stable control framework. Recently, deep reinforcement learning (RL) algorithms have been exploited in many studies for robot control to accomplish complex tasks. Unfortunately, deep RL algorithms might not be suitable for being deployed directly into a real-world robot platform due to the difficulty in interpreting the learned policy and lack of stability guarantee, especially for a complex task such as a wall-climbing drone. This paper proposes a novel hybrid architecture that reinforces a nominal controller with a robust policy learned using a model-free deep RL algorithm. The proposed architecture employs an uncertainty-aware control mixer to preserve guaranteed stability of a nominal controller while using the extended robust performance of the learned policy. The policy is trained in a simulated environment with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control