Subgraph-Oriented Testing for Deep Learning Libraries
Xiaoyuan Xie, Yan Song, Songqiang Chen, Jinfu Chen

TL;DR
This paper introduces SORT, a subgraph-oriented testing approach for deep learning libraries like PyTorch, which improves bug detection by focusing on realistic API interaction patterns and inputs, outperforming existing methods.
Contribution
SORT is a novel testing method that uses frequent subgraphs of model computation graphs to generate realistic API interaction sequences and inputs, enhancing bug detection accuracy.
Findings
Achieves 100% valid input generation rate.
Detects more precision bugs than existing methods.
Reveals interaction-related bugs missed by single-API testing.
Abstract
Deep Learning (DL) libraries, such as PyTorch, are widely used for building and deploying DL models on various hardware platforms. Meanwhile, they are found to contain bugs that lead to incorrect calculation results and cause issues like non-convergence training and inaccurate prediction of DL models. Thus, many efforts have been made to test DL libraries and reveal bugs. However, existing DL library testing methods manifest limitations: model-level testing methods cause complexity in fault localization. Meanwhile, API-level testing methods often generate invalid inputs or primarily focus on extreme inputs that lead to crash failures; they also ignore testing realistic API interactions. These limitations may lead to missing detection of bugs, even in the frequently used APIs. To address these limitations, we propose SORT (Subgraph-Oriented Realistic Testing) to differential test DL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Machine Learning and Data Classification · Data Quality and Management
