An Empirical Study of the Evolution of GitHub Actions Workflows
Pooya Rostami Mazrae, Alexandre Decan, Tom Mens, Mairieli Wessel

TL;DR
This study analyzes the evolution of GitHub Actions workflows over time, revealing patterns in changes, their nature, and the lack of impact from recent technological shifts, emphasizing the need for better tooling.
Contribution
It provides the first large-scale empirical analysis of GitHub Actions workflow changes, categorizing types of modifications and quantifying their frequency and scope.
Findings
Most workflows are small with few changes per update.
Workflow modifications mainly involve task configuration and specification.
No clear impact of LLM tools on workflow maintenance frequency.
Abstract
CI/CD practices play a significant role during collaborative software development by automating time-consuming and repetitive tasks such as testing, building, quality checking, dependency and security management. GitHub Actions, the CI/CD tool integrated into GitHub, allows repository maintainers to automate development workflows. We conducted a mixed methods analysis of GitHub Actions workflow changes over time. Through a preliminary qualitative analysis of 439 modified workflow files we identified seven types of conceptual changes to workflows. Next, we performed a quantitative analysis over 49K+ GitHub repositories totaling 267K+ workflow change histories and 3.4M+ workflow file versions from November 2019 to August 2025. This analysis revealed that repositories contain a median of three workflow files, and 7.3% of all workflow files are being changed every week. The changes made to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Software Engineering Research · Software System Performance and Reliability
