MDPFuzz: Testing Models Solving Markov Decision Processes
Qi Pang, Yuanyuan Yuan, Shuai Wang

TL;DR
MDPFuzz is a novel blackbox fuzz testing framework that identifies dangerous states in models solving MDPs, revealing hidden vulnerabilities and improving their robustness in safety-critical applications.
Contribution
This paper introduces MDPFuzz, the first framework for blackbox testing of MDP-solving models, using innovative techniques to detect and repair abnormal states.
Findings
Over 80 crash-triggering states found per model
Crash states induce distinct neuron activation patterns
Model robustness significantly improved after repairs
Abstract
The Markov decision process (MDP) provides a mathematical framework for modeling sequential decision-making problems, many of which are crucial to security and safety, such as autonomous driving and robot control. The rapid development of artificial intelligence research has created efficient methods for solving MDPs, such as deep neural networks (DNNs), reinforcement learning (RL), and imitation learning (IL). However, these popular models solving MDPs are neither thoroughly tested nor rigorously reliable. We present MDPFuzz, the first blackbox fuzz testing framework for models solving MDPs. MDPFuzz forms testing oracles by checking whether the target model enters abnormal and dangerous states. During fuzzing, MDPFuzz decides which mutated state to retain by measuring if it can reduce cumulative rewards or form a new state sequence. We design efficient techniques to quantify the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning · Bayesian Modeling and Causal Inference
MethodsRepair
