Adviser-Actor-Critic: Eliminating Steady-State Error in Reinforcement   Learning Control

Donghe Chen; Yubin Peng; Tengjie Zheng; Han Wang; Chaoran Qu; Lin; Cheng

arXiv:2502.02265·cs.LG·February 5, 2025

Adviser-Actor-Critic: Eliminating Steady-State Error in Reinforcement Learning Control

Donghe Chen, Yubin Peng, Tengjie Zheng, Han Wang, Chaoran Qu, Lin, Cheng

PDF

Open Access

TL;DR

This paper introduces Adviser-Actor-Critic (AAC), a reinforcement learning method that combines feedback control with adaptive learning to improve precision and eliminate steady-state error in control tasks, especially in robotics.

Contribution

AAC innovatively integrates feedback control with reinforcement learning, using an Adviser to mentor the actor for enhanced goal precision and robustness.

Findings

01

AAC outperforms standard RL algorithms in precision-critical tasks

02

Demonstrates high accuracy and robustness in goal attainment

03

Effective in robotics and real-world applications

Abstract

High-precision control tasks present substantial challenges for reinforcement learning (RL) algorithms, frequently resulting in suboptimal performance attributed to network approximation inaccuracies and inadequate sample quality.These issues are exacerbated when the task requires the agent to achieve a precise goal state, as is common in robotics and other real-world applications.We introduce Adviser-Actor-Critic (AAC), designed to address the precision control dilemma by combining the precision of feedback control theory with the adaptive learning capability of RL and featuring an Adviser that mentors the actor to refine control actions, thereby enhancing the precision of goal attainment.Finally, through benchmark tests, AAC outperformed standard RL algorithms in precision-critical, goal-conditioned tasks, demonstrating AAC's high precision, reliability, and robustness.Code are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Decision Making