Attention-based method for categorizing different types of online   harassment language

Christos Karatsalos; Yannis Panagiotakis

arXiv:1909.13104·cs.CL·April 20, 2020

Attention-based method for categorizing different types of online harassment language

Christos Karatsalos, Yannis Panagiotakis

PDF

TL;DR

This paper introduces a multi-attention RNN-based method for detecting various types of online harassment in tweets, addressing data imbalance with back-translation, and compares different RNN approaches.

Contribution

It proposes a novel multi-attention mechanism for harassment detection and tackles data imbalance, advancing NLP techniques for social media content moderation.

Findings

01

Effective harassment classification in tweets.

02

Improved handling of imbalanced datasets.

03

Comparison of RNN-based approaches.

Abstract

In the era of social media and networking platforms, Twitter has been doomed for abuse and harassment toward users specifically women. Monitoring the contents including sexism and sexual harassment in traditional media is easier than monitoring on the online social media platforms like Twitter, because of the large amount of user generated content in these media. So, the research about the automated detection of content containing sexual or racist harassment is an important issue and could be the basis for removing that content or flagging it for human evaluation. Previous studies have been focused on collecting data about sexism and racism in very broad terms. However, there is no much study focusing on different types of online harassment attracting natural language processing techniques. In this work, we present an multi-attention based approach for the detection of different types…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.