Features that Predict the Acceptability of Java and JavaScript Answers on Stack Overflow
Osayande P. Omondiagbe, Sherlock A. Licorish, Stephen G. MacDonell

TL;DR
This study identifies key features such as code length, user reputation, text similarity, and response time that predict whether an answer on Stack Overflow will be accepted, aiding in better answer selection.
Contribution
It introduces a predictive model using machine learning to distinguish accepted answers from unaccepted ones based on answer characteristics.
Findings
Code length and user reputation are strong predictors.
Text similarity between question and answer influences acceptance.
Time lag between question posting and answer impacts acceptance likelihood.
Abstract
Context: Stack Overflow is a popular community question and answer portal used by practitioners to solve problems during software development. Developers can focus their attention on answers that have been accepted or where members have recorded high votes in judging good answers when searching for help. However, the latter mechanism (votes) can be unreliable, and there is currently no way to differentiate between an answer that is likely to be accepted and those that will not be accepted by looking at the answer's characteristics. Objective: In potentially providing a mechanism to identify acceptable answers, this study examines the features that distinguish an accepted answer from an unaccepted answer. Methods: We studied the Stack Overflow dataset by analyzing questions and answers for the two most popular tags (Java and JavaScript). Our dataset comprised 249,588 posts drawn from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
