Toxicity Ahead: Forecasting Conversational Derailment on GitHub

Mia Mohammad Imran; Robert Zita; Rahat Rizvi Rahman; Preetha Chatterjee; Kostadin Damevski

arXiv:2512.15031·cs.SE·December 18, 2025

Toxicity Ahead: Forecasting Conversational Derailment on GitHub

Mia Mohammad Imran, Robert Zita, Rahat Rizvi Rahman, Preetha Chatterjee, Kostadin Damevski

PDF

Open Access

TL;DR

This paper introduces a novel LLM-based framework that predicts conversational derailment in GitHub discussions by analyzing conversation dynamics, achieving high accuracy and outperforming existing NLP methods for proactive moderation.

Contribution

It presents a new two-step LLM prompting approach to forecast toxicity in OSS discussions, with a curated dataset and validation showing superior performance.

Findings

01

The framework achieves F1-scores of 0.901 and 0.852 on test models.

02

Structured LLM prompting improves early detection of toxicity.

03

External validation confirms robustness with F1 up to 0.797.

Abstract

Toxic interactions in Open Source Software (OSS) communities reduce contributor engagement and threaten project sustainability. Preventing such toxicity before it emerges requires a clear understanding of how harmful conversations unfold. However, most proactive moderation strategies are manual, requiring significant time and effort from community maintainers. To support more scalable approaches, we curate a dataset of 159 derailed toxic threads and 207 non-toxic threads from GitHub discussions. Our analysis reveals that toxicity can be forecast by tension triggers, sentiment shifts, and specific conversational patterns. We present a novel Large Language Model (LLM)-based framework for predicting conversational derailment on GitHub using a two-step prompting pipeline. First, we generate \textit{Summaries of Conversation Dynamics} (SCDs) via Least-to-Most (LtM) prompting; then we use…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Software Engineering Research · Topic Modeling