A Deep Reinforcement Learning Approach to Rare Event Estimation

Anthony Corso; Kyu-Young Kim; Shubh Gupta; Grace Gao; Mykel J.; Kochenderfer

arXiv:2211.12470·cs.LG·November 23, 2022

A Deep Reinforcement Learning Approach to Rare Event Estimation

Anthony Corso, Kyu-Young Kim, Shubh Gupta, Grace Gao, Mykel J., Kochenderfer

PDF

Open Access

TL;DR

This paper introduces two adaptive importance sampling algorithms based on reinforcement learning principles to efficiently estimate rare event probabilities in sequential systems, addressing scalability issues of prior methods.

Contribution

Develops scalable adaptive importance sampling algorithms for rare event estimation in sequential decision systems using reinforcement learning techniques.

Findings

01

Improved accuracy over baseline methods in control tasks.

02

Effective variance reduction with multiple importance sampling.

03

Algorithms applicable to both continuous and discrete action spaces.

Abstract

An important step in the design of autonomous systems is to evaluate the probability that a failure will occur. In safety-critical domains, the failure probability is extremely small so that the evaluation of a policy through Monte Carlo sampling is inefficient. Adaptive importance sampling approaches have been developed for rare event estimation but do not scale well to sequential systems with long horizons. In this work, we develop two adaptive importance sampling algorithms that can efficiently estimate the probability of rare events for sequential decision making systems. The basis for these algorithms is the minimization of the Kullback-Leibler divergence between a state-dependent proposal distribution and a target distribution over trajectories, but the resulting algorithms resemble policy gradient and value-based reinforcement learning. We apply multiple importance sampling to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Reliability and Analysis Research · Safety Systems Engineering in Autonomy · Formal Methods in Verification