Particle Swarm Optimization for Generating Interpretable Fuzzy Reinforcement Learning Policies
Daniel Hein, Alexander Hentschel, Thomas Runkler, Steffen Udluft

TL;DR
This paper introduces FPSRL, a novel method combining particle swarm optimization with fuzzy reinforcement learning, trained on world models derived from previous data, to generate interpretable policies without online learning.
Contribution
It is the first to connect self-organizing fuzzy controllers with model-based batch RL, enabling safe, offline policy training in systems with known dynamics.
Findings
High-performing fuzzy policies achieved on benchmark tasks
Effective in domains where online learning is unsafe or impractical
Demonstrates interpretability and efficiency of the proposed approach
Abstract
Fuzzy controllers are efficient and interpretable system controllers for continuous state and action spaces. To date, such controllers have been constructed manually or trained automatically either using expert-generated problem-specific cost functions or incorporating detailed knowledge about the optimal control strategy. Both requirements for automatic training processes are not found in most real-world reinforcement learning (RL) problems. In such applications, online learning is often prohibited for safety reasons because online learning requires exploration of the problem's dynamics during policy training. We introduce a fuzzy particle swarm reinforcement learning (FPSRL) approach that can construct fuzzy RL policies solely by training parameters on world models that simulate real system dynamics. These world models are created by employing an autonomous machine learning technique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
