Comparison of Model Predictive Control and Proximal Policy Optimization   for a 1-DOF Helicopter System

Georg Sch\"afer; Jakob Rehrl; Stefan Huber; Simon Hirlaender

arXiv:2408.15633·eess.SY·August 29, 2024

Comparison of Model Predictive Control and Proximal Policy Optimization for a 1-DOF Helicopter System

Georg Sch\"afer, Jakob Rehrl, Stefan Huber, Simon Hirlaender

PDF

Open Access

TL;DR

This paper compares Model Predictive Control and Proximal Policy Optimization, a Deep Reinforcement Learning method, on a 1-DOF helicopter system, analyzing their performance, computational demands, and suitability for different control tasks.

Contribution

It provides a systematic comparison of MPC and PPO on a 1-DOF helicopter, highlighting their respective strengths, limitations, and practical considerations for control applications.

Findings

01

PPO shows superior rise-time and adaptability.

02

LQR achieves the best steady-state accuracy.

03

PPO offers promising rapid response capabilities.

Abstract

This study conducts a comparative analysis of Model Predictive Control (MPC) and Proximal Policy Optimization (PPO), a Deep Reinforcement Learning (DRL) algorithm, applied to a 1-Degree of Freedom (DOF) Quanser Aero 2 system. Classical control techniques such as MPC and Linear Quadratic Regulator (LQR) are widely used due to their theoretical foundation and practical effectiveness. However, with advancements in computational techniques and machine learning, DRL approaches like PPO have gained traction in solving optimal control problems through environment interaction. This paper systematically evaluates the dynamic response characteristics of PPO and MPC, comparing their performance, computational resource consumption, and implementation complexity. Experimental results show that while LQR achieves the best steady-state accuracy, PPO excels in rise-time and adaptability, making it a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Fault Detection and Control Systems · Control Systems and Identification

MethodsEntropy Regularization · Proximal Policy Optimization