Reproducibility of Issues Reported in Stack Overflow Questions: Challenges, Impact & Estimation
Saikat Mondal, Banani Roy

TL;DR
This paper investigates the challenges of reproducing issues in Stack Overflow questions, validates these challenges with practitioners, and develops ML models to predict reproducibility, aiming to improve question quality and response effectiveness.
Contribution
It provides a practitioner-validated catalog of reproducibility challenges, analyzes their impact, and introduces ML models with high accuracy for early issue reproducibility prediction.
Findings
90% of practitioners agree with identified challenges
Missing important code parts severely impacts reproducibility
ML models achieve over 82% accuracy in predicting reproducibility
Abstract
Software developers often submit questions to technical Q&A sites like Stack Overflow (SO) to resolve code-level problems. In practice, they include example code snippets with questions to explain the programming issues. Existing research suggests that users attempt to reproduce the reported issues using given code snippets when answering questions. Unfortunately, such code snippets could not always reproduce the issues due to several unmet challenges that prevent questions from receiving appropriate and prompt solutions. One previous study investigated reproducibility challenges and produced a catalog. However, how the practitioners perceive this challenge catalog is unknown. Practitioners' perspectives are inevitable in validating these challenges and estimating their severity. This study first surveyed 53 practitioners to understand their perspectives on reproducibility challenges.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExpert finding and Q&A systems · Advanced Text Analysis Techniques · Sentiment Analysis and Opinion Mining
