TL;DR
This paper explores the use of graph embedding techniques to detect abusive language in online conversations, aiming for language-independent moderation by automatically learning conversational graph representations.
Contribution
It introduces the application of recent graph embedding methods for abusive language detection and compares node and whole-graph embeddings on conversational data.
Findings
Certain embeddings capture specific topological features
Graph embeddings can be effective for language-independent moderation
Different approaches leverage different aspects of graph structure
Abstract
Abusive behaviors are common on online social networks. The increasing frequency of antisocial behaviors forces the hosts of online platforms to find new solutions to address this problem. Automating the moderation process has thus received a lot of interest in the past few years. Various methods have been proposed, most based on the exchanged content, and one relying on the structure and dynamics of the conversation. It has the advantage of being languageindependent, however it leverages a hand-crafted set of topological measures which are computationally expensive and not necessarily suitable to all situations. In the present paper, we propose to use recent graph embedding approaches to automatically learn representations of conversational graphs depicting message exchanges. We compare two categories: node vs. whole-graph embeddings. We experiment with a total of 8 approaches and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
