Fuz-RL: A Fuzzy-Guided Robust Framework for Safe Reinforcement Learning under Uncertainty

Xu Wan; Chao Yang; Cheng Yang; Jie Song; Mingyang Sun

arXiv:2602.20729·cs.LG·March 17, 2026

Fuz-RL: A Fuzzy-Guided Robust Framework for Safe Reinforcement Learning under Uncertainty

Xu Wan, Chao Yang, Cheng Yang, Jie Song, Mingyang Sun

PDF

Open Access

TL;DR

Fuz-RL introduces a fuzzy measure-guided framework for safe reinforcement learning that enhances robustness and safety by estimating value functions with a novel fuzzy Bellman operator, avoiding complex min-max optimization.

Contribution

The paper presents a new fuzzy Bellman operator and proves its equivalence to distributionally robust safe RL, improving safety and performance under uncertainty without min-max optimization.

Findings

01

Fuz-RL effectively improves safety in various uncertain environments.

02

It integrates seamlessly with existing safe RL methods.

03

Empirical results show significant safety and control performance gains.

Abstract

Safe Reinforcement Learning (RL) is crucial for achieving high performance while ensuring safety in real-world applications. However, the complex interplay of multiple uncertainty sources in real environments poses significant challenges for interpretable risk assessment and robust decision-making. To address these challenges, we propose Fuz-RL, a fuzzy measure-guided robust framework for safe RL. Specifically, our framework develops a novel fuzzy Bellman operator for estimating robust value functions using Choquet integrals. Theoretically, we prove that solving the Fuz-RL problem (in Constrained Markov Decision Process (CMDP) form) is equivalent to solving distributionally robust safe RL problems (in robust CMDP form), effectively avoiding min-max optimization. Empirical analyses on safe-control-gym and safety-gymnasium scenarios demonstrate that Fuz-RL effectively integrates with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Infrastructure Resilience and Vulnerability Analysis