From Code Changes to Quality Gains: An Empirical Study in Python ML Systems with PyQu

Mohamed Almukhtar; Anwar Ghammam; Marouane Kessentini; Hua Ming

arXiv:2511.02827·cs.SE·April 2, 2026

From Code Changes to Quality Gains: An Empirical Study in Python ML Systems with PyQu

Mohamed Almukhtar, Anwar Ghammam, Marouane Kessentini, Hua Ming

PDF

TL;DR

This study analyzes how specific code changes in Python ML systems impact software quality, introducing PyQu to identify quality-enhancing commits with high accuracy across a large dataset.

Contribution

The paper presents PyQu, a novel tool leveraging software metrics to detect quality-improving code changes in Python ML projects, revealing 41% previously unknown improvements.

Findings

01

PyQu achieves an average F1 score of 0.84 in identifying quality-enhancing commits.

02

Identified 61 code changes that directly improve software quality, classified into 13 categories.

03

41% of the detected quality improvements are new and not found by existing tools.

Abstract

In an era shaped by Generative Artificial Intelligence for code generation and the rising adoption of Python-based Machine Learning systems (MLS), software quality has emerged as a major concern. As these systems grow in complexity and importance, a key obstacle lies in understanding exactly how specific code changes affect overall quality-a shortfall aggravated by the lack of quality assessment tools and a clear mapping between ML systems code changes and their quality effects. Although prior work has explored code changes in MLS, it mostly stops at what the changes are, leaving a gap in our knowledge of the relationship between code changes and the MLS quality. To address this gap, we conducted a large-scale empirical study of 3,340 open-source Python ML projects, encompassing more than 3.7 million commits and 2.7 trillion lines of code. We introduce PyQu, a novel tool that leverages…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.