DeTox-Fed: Detecting Toxic Conversations in the Fediverse with Federated Graph Neural Networks
Pantelitsa Leonidou, Nikos Salamanos, Sotiris Gypsiotis, Michael Sirivianos

TL;DR
DeTox-Fed introduces a federated graph neural network framework for detecting toxic conversations in decentralized social networks, leveraging conversation structures and user interactions without sharing raw data.
Contribution
The paper presents a novel federated graph learning approach for toxicity detection in DSNs that preserves data privacy and combines conversational and user interaction features.
Findings
Achieves stable toxic conversation detection with limited local labels.
Performs well under partial client participation and varying moderation thresholds.
Demonstrates effectiveness on a large Pleroma dataset.
Abstract
The rise of decentralized social networks (DSNs), and in particular the rapid uptake of the Fediverse (e.g., Pleroma, Mastodon, Lemygrad), introduces new challenges in content moderation. Independent instances host their own data, follow different moderation policies, and often observe only partial views of conversations. We present DeTox-Fed, a federated graph-learning framework for detecting toxic conversations in DSNs without requiring instances to share raw conversations or moderation labels. Each instance constructs a local conversation graph, where nodes represent conversation trees and edges capture shared user participation across conversations. A Graph Neural Network (GNN) is then trained in a federated learning setup, allowing instances to collaboratively learn a toxicity classifier while preserving data locality. Unlike text-only moderation approaches, DeTox-Fed combines…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
