User-Aware Multilingual Abusive Content Detection in Social Media
Mohammad Zia Ur Rehman, Somya Mehta, Kuldeep Singh, Kunal Kaushik,, Nagendra Kumar

TL;DR
This paper introduces a novel user-aware, multilingual abusive content detection method tailored for low-resource Indic languages, leveraging social context and user history to improve accuracy.
Contribution
It presents a new approach combining social and text features for abusive content detection in low-resource languages, outperforming existing methods.
Findings
Outperforms state-of-the-art methods with 4.08% and 9.52% F1-score improvements.
Effectively utilizes social context and user history for better detection.
Validated on large-scale multilingual datasets with significant accuracy gains.
Abstract
Despite growing efforts to halt distasteful content on social media, multilingualism has added a new dimension to this problem. The scarcity of resources makes the challenge even greater when it comes to low-resource languages. This work focuses on providing a novel method for abusive content detection in multiple low-resource Indic languages. Our observation indicates that a post's tendency to attract abusive comments, as well as features such as user history and social context, significantly aid in the detection of abusive content. The proposed method first learns social and text context features in two separate modules. The integrated representation from these modules is learned and used for the final prediction. To evaluate the performance of our method against different classical and state-of-the-art methods, we have performed extensive experiments on SCIDN and MACI datasets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
