Generating GitHub Repository Descriptions: A Comparison of Manual and Automated Approaches
Jazlyn Hellman, Eunbee Jang, Christoph Treude, Chenzhun Huang, Jin, L.C. Guo

TL;DR
This paper evaluates manual and automated methods for generating GitHub repository descriptions, proposing the LSP template for clearer descriptions and demonstrating automated summarization's effectiveness through user studies.
Contribution
It introduces the LSP template for better repository descriptions and compares automated summarization techniques, highlighting their practical utility for improving GitHub project communication.
Findings
Automated summarization can generate adequate default descriptions.
The LSP template improves clarity and informativeness of repository descriptions.
User studies favor LSP-based descriptions for effective communication.
Abstract
Given the vast number of repositories hosted on GitHub, project discovery and retrieval have become increasingly important for GitHub users. Repository descriptions serve as one of the first points of contact for users who are accessing a repository. However, repository owners often fail to provide a high-quality description; instead, they use vague terms, the purpose of the repository is poorly explained, or the description is omitted entirely. In this work, we examine the current practice of writing GitHub repository descriptions. Our investigation leads to the proposal of the LSP (Language, Software technology, and Purpose) template to formulate good descriptions for GitHub repositories that are clear, concise, and informative. To understand the extent to which current automated techniques can support generating repository descriptions, we compare the performance of state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Topic Modeling · Web Data Mining and Analysis
