A Community-Centric Perspective for Characterizing and Detecting   Anti-Asian Violence-Provoking Speech

Gaurav Verma; Rynaa Grover; Jiawei Zhou; Binny Mathew; Jordan Kraemer,; Munmun De Choudhury; Srijan Kumar

arXiv:2407.15227·cs.CL·July 23, 2024

A Community-Centric Perspective for Characterizing and Detecting Anti-Asian Violence-Provoking Speech

Gaurav Verma, Rynaa Grover, Jiawei Zhou, Binny Mathew, Jordan Kraemer,, Munmun De Choudhury, Srijan Kumar

PDF

Open Access 1 Video

TL;DR

This study introduces a community-centric approach to characterizing and detecting anti-Asian violence-provoking speech on Twitter, highlighting the challenges of current classifiers and emphasizing the need for proactive interventions during crises.

Contribution

It develops a new codebook and dataset for violence-provoking speech, and compares NLP classifiers' effectiveness in detecting this harmful speech type.

Findings

01

Detection of violence-provoking speech is less accurate than hate speech ($F_1=0.69$ vs. 0.89).

02

Natural language classifiers struggle with reliably identifying violence-provoking content.

03

Community-based resources and tools are provided for large-scale detection.

Abstract

Violence-provoking speech -- speech that implicitly or explicitly promotes violence against the members of the targeted community, contributed to a massive surge in anti-Asian crimes during the pandemic. While previous works have characterized and built tools for detecting other forms of harmful speech, like fear speech and hate speech, our work takes a community-centric approach to studying anti-Asian violence-provoking speech. Using data from ~420k Twitter posts spanning a 3-year duration (January 1, 2020 to February 1, 2023), we develop a codebook to characterize anti-Asian violence-provoking speech and collect a community-crowdsourced dataset to facilitate its large-scale detection using state-of-the-art classifiers. We contrast the capabilities of natural language processing classifiers, ranging from BERT-based to LLM-based classifiers, in detecting violence-provoking speech with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

A Community-Centric Perspective for Characterizing and Detecting Anti-Asian Violence-Provoking Speech· underline

Taxonomy

TopicsHate Speech and Cyberbullying Detection