Mutation Testing of Deep Reinforcement Learning Based on Real Faults
Florian Tambon, Vahid Majdinasab, Amin Nikanjam, Foutse Khomh,, Giuliano Antonio

TL;DR
This paper introduces RLMutation, a framework for mutation testing in reinforcement learning, using fault taxonomies and heuristics to generate test cases, revealing how mutation killing definitions influence testing outcomes and enabling effective higher order mutations.
Contribution
It extends mutation testing to reinforcement learning by developing RLMutation, incorporating fault-based mutation operators and analyzing their impact on testing effectiveness.
Findings
Mutation killing definitions significantly affect mutation detection.
HOM can be generated with few test cases and operators.
Higher order mutations enhance RL testing capabilities.
Abstract
Testing Deep Learning (DL) systems is a complex task as they do not behave like traditional systems would, notably because of their stochastic nature. Nonetheless, being able to adapt existing testing techniques such as Mutation Testing (MT) to DL settings would greatly improve their potential verifiability. While some efforts have been made to extend MT to the Supervised Learning paradigm, little work has gone into extending it to Reinforcement Learning (RL) which is also an important component of the DL ecosystem but behaves very differently from SL. This paper builds on the existing approach of MT in order to propose a framework, RLMutation, for MT applied to RL. Notably, we use existing taxonomies of faults to build a set of mutation operators relevant to RL and use a simple heuristic to generate test cases for RL. This allows us to compare different mutation killing definitions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Adversarial Robustness in Machine Learning · Viral Infectious Diseases and Gene Expression in Insects
MethodsTest
