Source Code Hotspots: A Diagnostic Method for Quality Issues
Saleha Muzammil, Mughees Ur Rehman, Zoe Kotti, Diomidis Spinellis

TL;DR
This paper identifies common patterns of frequent code changes, called hotspots, in GitHub projects, and offers guidelines to reduce them, thereby improving software quality and maintainability.
Contribution
It introduces a taxonomy of 15 recurring hotspot patterns, links them to refactoring and CI checks, and highlights the role of automated accounts in hotspot creation.
Findings
74% of hotspot edits are generated by automated accounts
Pinched Version Bump is the most common hotspot pattern
Mapping patterns to guidelines helps reduce hotspots
Abstract
Software source code often harbours "hotspots": small portions of the code that change far more often than the rest of the project and thus concentrate maintenance activity. We mine the complete version histories of 91 evolving, actively developed GitHub repositories and identify 15 recurring line-level hotspot patterns that explain why these hotspots emerge. The three most prevalent patterns are Pinned Version Bump (26%), revealing brittle release practices; Long Line Change (17%), signalling deficient layout; and Formatting Ping-Pong (9%), indicating missing or inconsistent style automation. Surprisingly, automated accounts generate 74% of all hotspot edits, suggesting that bot activity is a dominant but largely avoidable source of noise in change histories. By mapping each pattern to concrete refactoring guidelines and continuous integration checks, our taxonomy equips practitioners…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Software System Performance and Reliability
