How Compliant Are GitHub Actions Workflows? A Checklist-Based Study with LLM-Assisted Auditing
Edward Abrokwah, Taher A. Ghaleb

TL;DR
This study evaluates GitHub Actions workflows for compliance with best practices using a new checklist and LLMs, revealing significant security and maintainability gaps and proposing a hybrid auditing framework.
Contribution
It introduces a comprehensive GHA compliance checklist and a multi-tier LLM-based auditing framework, providing empirical benchmarks and insights into automated security assessments.
Findings
Overall compliance is only 28%
Permission controls compliance is just 4%
LLMs reduce verification effort by 81% but need human oversight
Abstract
GitHub Actions (GHA) CI workflows are critical infrastructure, but current tooling offers only syntactic or heuristic checks and does not enforce documented best practices for security, maintainability, or performance. Consequently, issues like over-privileged permissions, weak secrets management, and missing failure notifications remain undetected in real-world pipelines. This paper proposes a novel, documentation-grounded GHA compliance checklist with 30 criteria spanning four workflow sections and eight themes, and assesses Large Language Models (LLMs) for scalable compliance auditing. On 95 real-world Java workflows (2,850 assessments) using four open-weight LLMs, we find only fair agreement (Fleiss' kappa = 0.28), with systematic disagreement on structural reasoning and security-sensitive judgments. To address this, we introduce a multi-tier adjudication framework in which GPT 5…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
