Graph-Based Fuzz Testing for Deep Learning Inference Engine
Weisi Luo, Dong Chai, Xiaoyue Run, Jiang Wang, Chunrong Fang, Zhenyu, Chen

TL;DR
This paper introduces a graph-based fuzz testing approach for deep learning inference engines that uses operator-level coverage and Monte Carlo Tree Search to generate diversified models, uncovering numerous exceptions and improving testing effectiveness.
Contribution
It presents a novel graph-structured fuzz testing method with operator-level coverage and MCTS-driven model generation for DL inference engines, which was not addressed before.
Findings
Outperforms random methods in coverage and exception detection.
Discovered over 40 different exceptions in inference engines.
Achieved up to 8.2% increase in operator-level coverage.
Abstract
With the wide use of Deep Learning (DL) systems, academy and industry begin to pay attention to their quality. Testing is one of the major methods of quality assurance. However, existing testing techniques focus on the quality of DL models but lacks attention to the core underlying inference engines (i.e., frameworks and libraries). Inspired by the success stories of fuzz testing, we design a graph-based fuzz testing method to improve the quality of DL inference engines. This method is naturally followed by the graph structure of DL models. A novel operator-level coverage criterion based on graph theory is introduced and six different mutations are implemented to generate diversified DL models by exploring combinations of model structures, parameters, and data inputs. The Monte Carlo Tree Search (MCTS) is used to drive DL model generation without a training process. The experimental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
