Extreme Risk Mitigation in Reinforcement Learning using Extreme Value   Theory

Karthik Somayaji NS; Yu Wang; Malachi Schram; Jan Drgona; Mahantesh; Halappanavar; Frank Liu; Peng Li

arXiv:2308.13011·cs.LG·August 30, 2023

Extreme Risk Mitigation in Reinforcement Learning using Extreme Value Theory

Karthik Somayaji NS, Yu Wang, Malachi Schram, Jan Drgona, Mahantesh, Halappanavar, Frank Liu, Peng Li

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel risk-sensitive reinforcement learning approach that employs extreme value theory to better model rare, catastrophic risk events, improving resilience and outperforming existing methods on benchmark tasks.

Contribution

It proposes a new EVT-based parameterization of the value distribution to enhance modeling of rare risky events in RL, with theoretical and empirical validation.

Findings

01

Outperforms existing risk-averse RL algorithms on benchmarks

02

Effectively models rare catastrophic events

03

Provides theoretical advantages of EVT-based distributions

Abstract

Risk-sensitive reinforcement learning (RL) has garnered significant attention in recent years due to the growing interest in deploying RL agents in real-world scenarios. A critical aspect of risk awareness involves modeling highly rare risk events (rewards) that could potentially lead to catastrophic outcomes. These infrequent occurrences present a formidable challenge for data-driven methods aiming to capture such risky events accurately. While risk-aware RL techniques do exist, their level of risk aversion heavily relies on the precision of the state-action value function estimation when modeling these rare occurrences. Our work proposes to enhance the resilience of RL agents when faced with very rare and risky events by focusing on refining the predictions of the extreme values predicted by the state-action value function distribution. To achieve this, we formulate the extreme values…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Extreme Risk Mitigation in Reinforcement Learning using Extreme Value Theory· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning