Understanding Bugs in Multi-Language Deep Learning Frameworks

Zengyang Li; Sicheng Wang; Wenshuo Wang; Peng Liang; Ran Mo; Bing Li

arXiv:2303.02695·cs.SE·March 7, 2023·1 cites

Understanding Bugs in Multi-Language Deep Learning Frameworks

Zengyang Li, Sicheng Wang, Wenshuo Wang, Peng Liang, Ran Mo, Bing Li

PDF

Open Access

TL;DR

This study analyzes nearly 1500 bugs in multi-language deep learning frameworks, classifying bug types, assessing their impacts, and highlighting the complexity and prevalence of multi-programming-language bugs to improve bug management.

Contribution

It provides a comprehensive manual analysis of bugs in three major MPL deep learning frameworks, revealing bug types, impacts, and the complexity of MPL bug fixes, which was previously underexplored.

Findings

01

28.6%, 31.4%, and 16.0% of bugs are MPL bugs in MXNet, PyTorch, and TensorFlow.

02

MPL bug fixes have higher code change complexity than SPL bug fixes.

03

MPL bug fixes in PyTorch have longer open times and greater communication complexity.

Abstract

Deep learning frameworks (DLFs) have been playing an increasingly important role in this intelligence age since they act as a basic infrastructure for an increasingly wide range of AIbased applications. Meanwhile, as multi-programming-language (MPL) software systems, DLFs are inevitably suffering from bugs caused by the use of multiple programming languages (PLs). Hence, it is of paramount significance to understand the bugs (especially the bugs involving multiple PLs, i.e., MPL bugs) of DLFs, which can provide a foundation for preventing, detecting, and resolving bugs in the development of DLFs. To this end, we manually analyzed 1497 bugs in three MPL DLFs, namely MXNet, PyTorch, and TensorFlow. First, we classified bugs in these DLFs into 12 types (e.g., algorithm design bugs and memory bugs) according to their bug labels and characteristics. Second, we further explored the impacts of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Adversarial Robustness in Machine Learning · Software Testing and Debugging Techniques

MethodsSemi-Pseudo-Label