Reinforcement Control with Hierarchical Backpropagated Adaptive Critics
John W. Jameson

TL;DR
This paper introduces a hierarchical reinforcement control architecture using two Backpropagated Adaptive Critics (BACs) to improve credit assignment over long time horizons in dynamic systems requiring frequent control updates.
Contribution
It proposes a novel hierarchical control framework with two BACs and introduces Response Induction Learning for emergent low-level responses during learning.
Findings
Hierarchical BAC architecture enhances control stability over long periods.
Response Induction Learning enables adaptive low-level responses.
The approach improves credit assignment in complex dynamical systems.
Abstract
Present incremental learning methods are limited in the ability to achieve reliable credit assignment over a large number time steps (or events). However, this situation is typical for cases where the dynamical system to be controlled requires relatively frequent control updates in order to maintain stability or robustness yet has some action-consequences which must be established over relatively long periods of time. To address this problem, the learning capabilities of a control architecture comprised of two Backpropagated Adaptive Critics (BACs) in a two-level hierarchy with continuous actions are explored. The high-level BAC updates less frequently than the low-level BAC and controls the latter to some degree. The response of the low-level to high-level signals can either be determined a priori or it can emerge during learning. A general approach called Response Induction Learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Advanced Control Systems Optimization
