Long-Range Correlation in Code Commit Dynamics as a Novel Indicator of Software Product Stability: A Detrended Fluctuation Analysis Study
Goran Mitevski

TL;DR
This study introduces a fractal scaling exponent derived from commit activity time series as a new indicator of software stability, revealing that long-range correlations in commits relate to product robustness.
Contribution
It demonstrates that the fractal scaling exponent alpha, estimated via DFA on commit data, effectively distinguishes stable from unstable software development periods.
Findings
Stable periods show higher alpha values indicating stronger long-range correlations.
Unstable periods have more commits but lower long-range memory, indicating volume alone isn't predictive.
Results are validated against surrogate data and are robust across parameter settings.
Abstract
This work proposes the fractal scaling exponent alpha, estimated via Detrended Fluctuation Analysis (DFA) on the unaggregated time series of lines of code added per commit event in a software repository, as a novel process-level indicator of software product stability. The proposal rests on the hypothesis that stable software products arise from development processes characterised by long-range temporal correlations in commit behaviour: each code addition is shaped not only by the immediately preceding commits but by patterns extending weeks or months into the past and anticipating work to be done in the future. This hypothesis is tested on two non-overlapping 712-day time series of lines of code added per commit event, drawn from a closed-source software organisation and labeled as stable and unstable by the lead engineer on the basis of crash-analytics data. Applied to these series,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
