# Efficient Autonomy Validation in Simulation with Adaptive Stress Testing

**Authors:** Mark Koren, Mykel Kochenderfer

arXiv: 1907.06795 · 2019-07-17

## TL;DR

This paper introduces an improved adaptive stress testing method using recurrent neural networks to efficiently and robustly identify failure scenarios in autonomous systems, especially in complex, continuous initial condition spaces.

## Contribution

The authors develop a recurrent neural network-based solver for adaptive stress testing, enabling generalization across continuous initial conditions and reducing computational complexity.

## Key findings

- Successfully identified failure scenarios in autonomous driving simulations
- Reduced computational complexity compared to previous methods
- Demonstrated effectiveness in complex, realistic scenarios

## Abstract

During the development of autonomous systems such as driverless cars, it is important to characterize the scenarios that are most likely to result in failure. Adaptive Stress Testing (AST) provides a way to search for the most-likely failure scenario as a Markov decision process (MDP). Our previous work used a deep reinforcement learning (DRL) solver to identify likely failure scenarios. However, the solver's use of a feed-forward neural network with a discretized space of possible initial conditions poses two major problems. First, the system is not treated as a black box, in that it requires analyzing the internal state of the system, which leads to considerable implementation complexities. Second, in order to simulate realistic settings, a new instance of the solver needs to be run for each initial condition. Running a new solver for each initial condition not only significantly increases the computational complexity, but also disregards the underlying relationship between similar initial conditions. We provide a solution to both problems by employing a recurrent neural network that takes a set of initial conditions from a continuous space as input. This approach enables robust and efficient detection of failures because the solution generalizes across the entire space of initial conditions. By simulating an instance where an autonomous car drives while a pedestrian is crossing a road, we demonstrate the solver is now capable of finding solutions for problems that would have previously been intractable.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.06795/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/1907.06795/full.md

## References

19 references — full list in the complete paper: https://tomesphere.com/paper/1907.06795/full.md

---
Source: https://tomesphere.com/paper/1907.06795