Incivility Detection in Open Source Code Review and Issue Discussions
Isabella Ferreira, Ahlaam Rafiq, Jinghui Cheng

TL;DR
This study compares classical machine learning models and BERT for detecting incivility in open source discussions, finding BERT outperforms others but adding context does not improve performance, with challenges in cross-platform accuracy.
Contribution
The paper evaluates the effectiveness of BERT versus classical models for incivility detection and explores the impact of contextual information and cross-platform performance.
Findings
BERT achieves the highest F1-score of 0.95.
Classical models underperform in detecting civil discussions.
Adding context does not improve BERT's performance.
Abstract
Given the democratic nature of open source development, code review and issue discussions may be uncivil. Incivility, defined as features of discussion that convey an unnecessarily disrespectful tone, can have negative consequences to open source communities. To prevent or minimize these negative consequences, open source platforms have included mechanisms for removing uncivil language from the discussions. However, such approaches require manual inspection, which can be overwhelming given the large number of discussions. To help open source communities deal with this problem, in this paper, we aim to compare six classical machine learning models with BERT to detect incivility in open source code review and issue discussions. Furthermore, we assess if adding contextual information improves the models' performance and how well the models perform in a cross-platform setting. We found that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Software Engineering Research · Open Source Software Innovations
