DeepCruiser: Automated Guided Testing for Stateful Deep Learning Systems
Xiaoning Du, Xiaofei Xie, Yi Li, Lei Ma, Jianjun Zhao, Yang Liu

TL;DR
DeepCruiser introduces an automated testing framework for RNN-based stateful deep learning systems, addressing unique challenges in quality assurance by modeling state transitions and generating large-scale tests to uncover defects.
Contribution
This paper pioneers testing methodologies specifically for stateful RNN-based deep learning systems, including modeling, coverage criteria, and automated test generation.
Findings
Effective in uncovering defects in RNN-based systems
Improves quality and reliability of stateful DL systems
Demonstrated on a speech-to-text application
Abstract
Deep learning (DL) defines a data-driven programming paradigm that automatically composes the system decision logic from the training data. In company with the data explosion and hardware acceleration during the past decade, DL achieves tremendous success in many cutting-edge applications. However, even the state-of-the-art DL systems still suffer from quality and reliability issues. It was only until recently that some preliminary progress was made in testing feed-forward DL systems. In contrast to feed-forward DL systems, recurrent neural networks (RNN) follow a very different architectural design, implementing temporal behaviors and memory with loops and internal states. Such stateful nature of RNN contributes to its success in handling sequential inputs such as audio, natural languages and video processing, but also poses new challenges for quality assurance. In this paper, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Software Testing and Debugging Techniques · Anomaly Detection Techniques and Applications
