Testing in the Evolving World of DL Systems:Insights from Python GitHub   Projects

Qurban Ali; Oliviero Riganelli; Leonardo Mariani

arXiv:2405.19976·cs.SE·October 24, 2024

Testing in the Evolving World of DL Systems:Insights from Python GitHub Projects

Qurban Ali, Oliviero Riganelli, Leonardo Mariani

PDF

Open Access 1 Repo

TL;DR

This paper analyzes testing practices in open-source Deep Learning projects on GitHub, highlighting the adoption, types, and evolution of testing methodologies to improve project quality and reliability.

Contribution

It provides the first comprehensive analysis of testing practices and their evolution in GitHub-hosted DL projects, offering valuable insights for practitioners.

Findings

01

Testing adoption is increasing in DL projects.

02

Unit and integration tests are most common.

03

Test suite growth correlates with project maturity.

Abstract

In the ever-evolving field of Deep Learning (DL), ensuring project quality and reliability remains a crucial challenge. This research investigates testing practices within DL projects in GitHub. It quantifies the adoption of testing methodologies, focusing on aspects like test automation, the types of tests (e.g., unit, integration, and system), test suite growth rate, and evolution of testing practices across different project versions. We analyze a subset of 300 carefully selected repositories based on quantitative and qualitative criteria. This study reports insights on the prevalence of testing practices in DL projects within the open-source community.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lakhanqurban/pdltesting
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Physics and Python Applications · Software Testing and Debugging Techniques · Software System Performance and Reliability