TL;DR
This study evaluates 26 models for predicting the best answer in technical Q&A sites, demonstrating their effectiveness within Stack Overflow and across different platforms, with implications for improving knowledge curation.
Contribution
It provides a comprehensive assessment of best-answer prediction models and offers practical recommendations for Q&A platform design and knowledge management.
Findings
Model choice and parameter tuning significantly affect prediction accuracy.
Prediction models generalize well across different technical Q&A sites.
Automated tuning improves model performance.
Abstract
Technical Q&A sites have become essential for software engineers as they constantly seek help from other experts to solve their work problems. Despite their success, many questions remain unresolved, sometimes because the asker does not acknowledge any helpful answer. In these cases, an information seeker can only browse all the answers within a question thread to assess their quality as potential solutions. We approach this time-consuming problem as a binary-classification task where a best-answer prediction model is built to identify the accepted answer among those within a resolved question thread, and the candidate solutions to those questions that have received answers but are still unresolved. In this paper, we report on a study aimed at assessing 26 best-answer prediction models in two steps. First, we study how models perform when predicting best answers in Stack Overflow, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
