Understanding Counterspeech for Online Harm Mitigation
Yi-Ling Chung, Gavin Abercrombie, Florence Enock, Jonathan Bright,, Verena Rieser

TL;DR
This paper reviews counterspeech strategies for online hate mitigation, analyzing social science and computer science research to identify effective methods and future research directions.
Contribution
It provides a systematic review of counterspeech research across disciplines, highlighting gaps and proposing future directions for automated counterspeech generation.
Findings
Certain types of counterspeech are more effective in reducing hate.
Automated counterspeech can scale online hate mitigation efforts.
Interdisciplinary approaches enhance understanding of counterspeech efficacy.
Abstract
Counterspeech offers direct rebuttals to hateful speech by challenging perpetrators of hate and showing support to targets of abuse. It provides a promising alternative to more contentious measures, such as content moderation and deplatforming, by contributing a greater amount of positive online speech rather than attempting to mitigate harmful content through removal. Advances in the development of large language models mean that the process of producing counterspeech could be made more efficient by automating its generation, which would enable large-scale online campaigns. However, we currently lack a systematic understanding of several important factors relating to the efficacy of counterspeech for hate mitigation, such as which types of counterspeech are most effective, what are the optimal conditions for implementation, and which specific effects of hate it can best ameliorate.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Social Media and Politics
