A Deep Reinforcement Learning Approach to Rare Event Estimation
Anthony Corso, Kyu-Young Kim, Shubh Gupta, Grace Gao, Mykel J., Kochenderfer

TL;DR
This paper introduces two adaptive importance sampling algorithms based on reinforcement learning principles to efficiently estimate rare event probabilities in sequential systems, addressing scalability issues of prior methods.
Contribution
Develops scalable adaptive importance sampling algorithms for rare event estimation in sequential decision systems using reinforcement learning techniques.
Findings
Improved accuracy over baseline methods in control tasks.
Effective variance reduction with multiple importance sampling.
Algorithms applicable to both continuous and discrete action spaces.
Abstract
An important step in the design of autonomous systems is to evaluate the probability that a failure will occur. In safety-critical domains, the failure probability is extremely small so that the evaluation of a policy through Monte Carlo sampling is inefficient. Adaptive importance sampling approaches have been developed for rare event estimation but do not scale well to sequential systems with long horizons. In this work, we develop two adaptive importance sampling algorithms that can efficiently estimate the probability of rare events for sequential decision making systems. The basis for these algorithms is the minimization of the Kullback-Leibler divergence between a state-dependent proposal distribution and a target distribution over trajectories, but the resulting algorithms resemble policy gradient and value-based reinforcement learning. We apply multiple importance sampling to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Reliability and Analysis Research · Safety Systems Engineering in Autonomy · Formal Methods in Verification
