MDPFuzz: Testing Models Solving Markov Decision Processes

Qi Pang; Yuanyuan Yuan; Shuai Wang

arXiv:2112.02807·cs.SE·April 13, 2023

MDPFuzz: Testing Models Solving Markov Decision Processes

Qi Pang, Yuanyuan Yuan, Shuai Wang

PDF

Open Access

TL;DR

MDPFuzz is a novel blackbox fuzz testing framework that identifies dangerous states in models solving MDPs, revealing hidden vulnerabilities and improving their robustness in safety-critical applications.

Contribution

This paper introduces MDPFuzz, the first framework for blackbox testing of MDP-solving models, using innovative techniques to detect and repair abnormal states.

Findings

01

Over 80 crash-triggering states found per model

02

Crash states induce distinct neuron activation patterns

03

Model robustness significantly improved after repairs

Abstract

The Markov decision process (MDP) provides a mathematical framework for modeling sequential decision-making problems, many of which are crucial to security and safety, such as autonomous driving and robot control. The rapid development of artificial intelligence research has created efficient methods for solving MDPs, such as deep neural networks (DNNs), reinforcement learning (RL), and imitation learning (IL). However, these popular models solving MDPs are neither thoroughly tested nor rigorously reliable. We present MDPFuzz, the first blackbox fuzz testing framework for models solving MDPs. MDPFuzz forms testing oracles by checking whether the target model enters abnormal and dangerous states. During fuzzing, MDPFuzz decides which mutated state to retain by measuring if it can reduce cumulative rewards or form a new state sequence. We design efficient techniques to quantify the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning · Bayesian Modeling and Causal Inference

MethodsRepair