Analysis of Model-Free Reinforcement Learning Control Schemes on   self-balancing Wheeled Extendible System

Kanishk .; Rushil Kumar; Vikas Rastogi; Ajeet Kumar

arXiv:2111.08389·cs.RO·March 16, 2022·1 cites

Analysis of Model-Free Reinforcement Learning Control Schemes on self-balancing Wheeled Extendible System

Kanishk ., Rushil Kumar, Vikas Rastogi, Ajeet Kumar

PDF

Open Access

TL;DR

This paper explores the application of deep reinforcement learning algorithms, specifically Deep Deterministic Policy Gradient and Proximal Policy Optimization, to control a self-balancing extendable wheeled system, demonstrating improved adaptability over traditional methods.

Contribution

It introduces RL-based control schemes for a complex nonlinear system and compares their performance with Model Predictive Control, highlighting their effectiveness and self-tuning capabilities.

Findings

01

RL controllers outperform MPC in trajectory tracking accuracy

02

Deep RL models adapt better to system dynamics

03

Self-tuning parameters improve control stability

Abstract

Traditional linear control strategies have been extensively researched and utilized in many robotic and industrial applications and yet they do not respond to the total dynamics of the systems. To avoid tedious calculations for nonlinear control schemes like H-infinity control and predictive control, the application of Reinforcement Learning(RL) can provide alternative solutions. This article presents the implementation of RL control with Deep Deterministic Policy Gradient and Proximal Policy Optimization on a mobile self-balancing Extendable Wheeled Inverted Pendulum (E-WIP) system with provided state history to attain improved control. Such RL models make the task of finding satisfactory control schemes easier and responding to the dynamics effectively while self-tuning the parameters to provide better control. In this article, RL-based controllers are pitted against an MPC controller…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Smart Grid Energy Management