Coverage Guided Testing for Recurrent Neural Networks

Wei Huang; Youcheng Sun; Xingyu Zhao; James Sharp; Wenjie Ruan; Jie; Meng; and Xiaowei Huang

arXiv:1911.01952·cs.LG·May 14, 2021

Coverage Guided Testing for Recurrent Neural Networks

Wei Huang, Youcheng Sun, Xingyu Zhao, James Sharp, Wenjie Ruan, Jie, Meng, and Xiaowei Huang

PDF

1 Repo

TL;DR

This paper introduces a coverage guided testing approach for LSTM-based RNNs, utilizing novel metrics and genetic algorithms to detect defects more effectively and interpretably than existing methods.

Contribution

It develops a new set of test metrics and a genetic algorithm for systematic RNN testing, with an implementation that outperforms state-of-the-art tools.

Findings

01

TestRNN outperforms DeepStellar and attack-based methods.

02

Finer temporal semantics improve defect detection.

03

Provides interpretable testing results.

Abstract

Recurrent neural networks (RNNs) have been applied to a broad range of applications, including natural language processing, drug discovery, and video recognition. Their vulnerability to input perturbation is also known. Aligning with a view from software defect detection, this paper aims to develop a coverage guided testing approach to systematically exploit the internal behaviour of RNNs, with the expectation that such testing can detect defects with high possibility. Technically, the long short term memory network (LSTM), a major class of RNNs, is thoroughly studied. A family of three test metrics are designed to quantify not only the values but also the temporal relations (including both step-wise and bounded-length) exhibited when LSTM processing inputs. A genetic algorithm is applied to efficiently generate test cases. The test metrics and test case generation algorithm are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TrustAI/DeepConcolic
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTest · Sigmoid Activation · Tanh Activation · Long Short-Term Memory