Characterizing Linguistic Attributes for Automatic Classification of   Intent Based Racist/Radicalized Posts on Tumblr Micro-Blogging Website

Swati Agarwal; Ashish Sureka

arXiv:1701.04931·cs.IR·January 19, 2017·48 cites

Characterizing Linguistic Attributes for Automatic Classification of Intent Based Racist/Radicalized Posts on Tumblr Micro-Blogging Website

Swati Agarwal, Ashish Sureka

PDF

Open Access

TL;DR

This paper develops an ensemble learning classifier to detect racist or radicalized intent in Tumblr posts by analyzing semantic, sentiment, and linguistic features, addressing ambiguity beyond keyword spotting.

Contribution

It introduces a novel approach combining semantic, sentiment, and linguistic features with ensemble learning to identify racist intent in social media posts.

Findings

01

Effective discrimination of racist posts using emotion tone and social tendencies.

02

Linguistic cues and personality traits are key features for intent classification.

03

Proposed method outperforms baseline keyword-based techniques.

Abstract

Research shows that many like-minded people use popular microblogging websites for posting hateful speech against various religions and race. Automatic identification of racist and hate promoting posts is required for building social media intelligence and security informatics based solutions. However, just keyword spotting based techniques cannot be used to accurately identify the intent of a post. In this paper, we address the challenge of the presence of ambiguity in such posts by identifying the intent of author. We conduct our study on Tumblr microblogging website and develop a cascaded ensemble learning classifier for identifying the posts having racist or radicalized intent. We train our model by identifying various semantic, sentiment and linguistic features from free-form text. Our experimental results shows that the proposed approach is effective and the emotion tone, social…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Social Media and Politics · Spam and Phishing Detection