Linking Code and Documentation Churn: Preliminary Analysis
Ani Hovhannisyan, Youmei Fan, Gema Rodriguez-Perez, Raula Gaikovina, Kula

TL;DR
This paper explores the relationship between code changes and documentation updates in open-source projects, highlighting how synchronizing them can improve maintainability and proposing AI tools to assist in this process.
Contribution
It introduces a novel analysis linking code churn with documentation updates, emphasizing the potential of AI to automate and improve synchronization.
Findings
Varying degrees of synchrony across projects
Importance of integrated documentation practices
Potential of AI and LLMs for auto-generating documentation
Abstract
Code churn refers to the measure of the amount of code added, modified, or deleted in a project and is often used to assess codebase stability and maintainability. Program comprehension or how understandable the changes are, is equally important for maintainability. Documentation is crucial for knowledge transfer, especially when new maintainers take over abandoned code. We emphasize the need for corresponding documentation updates, as this reflects project health and trustworthiness as a third-party library. Therefore, we argue that every code change should prompt a documentation update (defined as documentation churn). Linking code churn changes with documentation updates is important for project sustainability, as it facilitates knowledge transfer and reduces the effort required for program comprehension. This study investigates the synchrony between code churn and documentation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBusiness Process Modeling and Analysis
