From Issues to Insights: RAG-based Explanation Generation from Software Engineering Artifacts
Daniel P\"ottgen, Mersedeh Sadeghi, Max Unterbusch, Andreas Vogelsang

TL;DR
This paper introduces a novel RAG-based method for generating software explanations from issue-tracking data, demonstrating high alignment with human explanations and enhancing software transparency.
Contribution
It is the first to apply RAG techniques to generate explanations from issue-tracking systems, showcasing feasibility and effectiveness.
Findings
Achieved 90% alignment with human explanations
Demonstrated strong faithfulness and instruction adherence
Proved RAG's potential for software explainability
Abstract
The increasing complexity of modern software systems has made understanding their behavior increasingly challenging, driving the need for explainability to improve transparency and user trust. Traditional documentation is often outdated or incomplete, making it difficult to derive accurate, context-specific explanations. Meanwhile, issue-tracking systems capture rich and continuously updated development knowledge, but their potential for explainability remains untapped. With this work, we are the first to apply a Retrieval-Augmented Generation (RAG) approach for generating explanations from issue-tracking data. Our proof-of-concept system is implemented using open-source tools and language models, demonstrating the feasibility of leveraging structured issue data for explanation generation. Evaluating our approach on an exemplary project's set of GitHub issues, we achieve 90% alignment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Scientific Computing and Data Management · Software Engineering Research
