Predictive Analytics for Collaborators Answers, Code Quality, and Dropout on Stack Overflow
Elijah Zolduoarrati, Sherlock A. Licorish, Nigel Stanger

TL;DR
This study benchmarks 21 machine learning algorithms across three tasks on Stack Overflow data, providing insights into the most effective models and hyperparameters for predicting user activity, code quality, and dropout.
Contribution
It offers a comprehensive comparison of multiple models and hyperparameter optimization techniques for three key Stack Overflow user prediction tasks, which was lacking in prior research.
Findings
Bagging with standardisation best predicts user answers (R2=0.821)
Gradient boosting and SVM outperform others in code quality prediction
Extreme Gradient Boosting and CodeBERT excel in dropout classification
Abstract
Previous studies that used data from Stack Overflow to develop predictive models often employed limited benchmarks of 3-5 models or adopted arbitrary selection methods. Despite being insightful, their limited scope suggests the need to benchmark more models to avoid overlooking untested algorithms. Our study evaluates 21 algorithms across three tasks: predicting the number of question a user is likely to answer, their code quality violations, and their dropout status. We employed normalisation, standardisation, as well as logarithmic and power transformations paired with Bayesian hyperparameter optimisation and genetic algorithms. CodeBERT, a pre-trained language model for both natural and programming languages, was fine-tuned to classify user dropout given their posts (questions and answers) and code snippets. We found Bagging ensemble models combined with standardisation achieved the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
