Dynamic Reinforcement Learning for Actors
Katsunari Shibata

TL;DR
This paper introduces Dynamic Reinforcement Learning, which directly controls system dynamics to enhance exploration and adaptability in RL agents, representing a significant shift from traditional static policy control.
Contribution
The paper proposes a novel Dynamic RL framework that manipulates system dynamics via sensitivity control, enabling flexible exploration and improved reproducibility without external noise.
Findings
Effective on dynamic tasks without external noise
Enhanced adaptability to new environments
Controlled convergence and divergence of dynamics
Abstract
Dynamic Reinforcement Learning (Dynamic RL), proposed in this paper, directly controls system dynamics, instead of the actor (action-generating neural network) outputs at each moment, bringing about a major qualitative shift in reinforcement learning (RL) from static to dynamic. The actor is initially designed to generate chaotic dynamics through the loop with its environment, enabling the agent to perform flexible and deterministic exploration. Dynamic RL controls global system dynamics using a local index called "sensitivity," which indicates how much the input neighborhood contracts or expands into the corresponding output neighborhood through each neuron's processing. While sensitivity adjustment learning (SAL) prevents excessive convergence of the dynamics, sensitivity-controlled reinforcement learning (SRL) adjusts them -- to converge more to improve reproducibility around better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSupply Chain and Inventory Management · Advanced Research in Systems and Signal Processing
