Search-Based Testing of Reinforcement Learning
Martin Tappler, Filip Cano C\'ordoba, Bernhard K. Aichernig and, Bettina K\"onighofer

TL;DR
This paper introduces a search-based testing framework for deep reinforcement learning that assesses safety and robustness by identifying safety-critical states and generating diverse test traces, demonstrated on Super Mario Bros.
Contribution
It presents a novel search-based testing method for deep RL that evaluates safety and performance through boundary state analysis and fuzz testing.
Findings
Effectively identifies safety-critical boundary states.
Generates diverse test traces for robustness evaluation.
Provides insights into RL agent safety and performance.
Abstract
Evaluation of deep reinforcement learning (RL) is inherently challenging. Especially the opaqueness of learned policies and the stochastic nature of both agents and environments make testing the behavior of deep RL agents difficult. We present a search-based testing framework that enables a wide range of novel analysis capabilities for evaluating the safety and performance of deep RL agents. For safety testing, our framework utilizes a search algorithm that searches for a reference trace that solves the RL task. The backtracking states of the search, called boundary states, pose safety-critical situations. We create safety test-suites that evaluate how well the RL agent escapes safety-critical situations near these boundary states. For robust performance testing, we create a diverse set of traces via fuzz testing. These fuzz traces are used to bring the agent into a wide variety of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Sports Analytics and Performance · Artificial Intelligence in Games
