Subgraph-Oriented Testing for Deep Learning Libraries

Xiaoyuan Xie; Yan Song; Songqiang Chen; Jinfu Chen

arXiv:2412.06430·cs.SE·June 10, 2025

Subgraph-Oriented Testing for Deep Learning Libraries

Xiaoyuan Xie, Yan Song, Songqiang Chen, Jinfu Chen

PDF

Open Access

TL;DR

This paper introduces SORT, a subgraph-oriented testing approach for deep learning libraries like PyTorch, which improves bug detection by focusing on realistic API interaction patterns and inputs, outperforming existing methods.

Contribution

SORT is a novel testing method that uses frequent subgraphs of model computation graphs to generate realistic API interaction sequences and inputs, enhancing bug detection accuracy.

Findings

01

Achieves 100% valid input generation rate.

02

Detects more precision bugs than existing methods.

03

Reveals interaction-related bugs missed by single-API testing.

Abstract

Deep Learning (DL) libraries, such as PyTorch, are widely used for building and deploying DL models on various hardware platforms. Meanwhile, they are found to contain bugs that lead to incorrect calculation results and cause issues like non-convergence training and inaccurate prediction of DL models. Thus, many efforts have been made to test DL libraries and reveal bugs. However, existing DL library testing methods manifest limitations: model-level testing methods cause complexity in fault localization. Meanwhile, API-level testing methods often generate invalid inputs or primarily focus on extreme inputs that lead to crash failures; they also ignore testing realistic API interactions. These limitations may lead to missing detection of bugs, even in the frequently used APIs. To address these limitations, we propose SORT (Subgraph-Oriented Realistic Testing) to differential test DL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques · Machine Learning and Data Classification · Data Quality and Management