Particle Swarm Optimization for Generating Interpretable Fuzzy   Reinforcement Learning Policies

Daniel Hein; Alexander Hentschel; Thomas Runkler; Steffen Udluft

arXiv:1610.05984·cs.NE·August 18, 2017

Particle Swarm Optimization for Generating Interpretable Fuzzy Reinforcement Learning Policies

Daniel Hein, Alexander Hentschel, Thomas Runkler, Steffen Udluft

PDF

TL;DR

This paper introduces FPSRL, a novel method combining particle swarm optimization with fuzzy reinforcement learning, trained on world models derived from previous data, to generate interpretable policies without online learning.

Contribution

It is the first to connect self-organizing fuzzy controllers with model-based batch RL, enabling safe, offline policy training in systems with known dynamics.

Findings

01

High-performing fuzzy policies achieved on benchmark tasks

02

Effective in domains where online learning is unsafe or impractical

03

Demonstrates interpretability and efficiency of the proposed approach

Abstract

Fuzzy controllers are efficient and interpretable system controllers for continuous state and action spaces. To date, such controllers have been constructed manually or trained automatically either using expert-generated problem-specific cost functions or incorporating detailed knowledge about the optimal control strategy. Both requirements for automatic training processes are not found in most real-world reinforcement learning (RL) problems. In such applications, online learning is often prohibited for safety reasons because online learning requires exploration of the problem's dynamics during policy training. We introduce a fuzzy particle swarm reinforcement learning (FPSRL) approach that can construct fuzzy RL policies solely by training parameters on world models that simulate real system dynamics. These world models are created by employing an autonomous machine learning technique…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.