Towards Effective Counter-Responses: Aligning Human Preferences with   Strategies to Combat Online Trolling

Huije Lee; Hoyun Song; Jisu Shin; Sukmin Cho; SeungYoon Han; Jong C.; Park

arXiv:2410.04164·cs.CL·October 8, 2024

Towards Effective Counter-Responses: Aligning Human Preferences with Strategies to Combat Online Trolling

Huije Lee, Hoyun Song, Jisu Shin, Sukmin Cho, SeungYoon Han, Jong C., Park

PDF

Open Access 1 Repo

TL;DR

This paper explores how aligning human preferences with specific response strategies can effectively mitigate online trolling, promoting healthier online discussions and reducing negative impacts.

Contribution

It introduces a methodology for recommending appropriate counter-responses based on troll types, supported by a dataset linking strategies to human preferences.

Findings

01

Preferred response strategies vary with troll types

02

The proposed approach guides constructive discussions

03

It reduces the negative effects of trolling

Abstract

Trolling in online communities typically involves disruptive behaviors such as provoking anger and manipulating discussions, leading to a polarized atmosphere and emotional distress. Robust moderation is essential for mitigating these negative impacts and maintaining a healthy and constructive community atmosphere. However, effectively addressing trolls is difficult because their behaviors vary widely and require different response strategies (RSs) to counter them. This diversity makes it challenging to choose an appropriate RS for each specific situation. To address this challenge, our research investigates whether humans have preferred strategies tailored to different types of trolling behaviors. Our findings reveal a correlation between the types of trolling encountered and the preferred RS. In this paper, we introduce a methodology for generating counter-responses to trolls by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

huijelee/ELF-HumanPreference
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Cybercrime and Law Enforcement Studies · Spam and Phishing Detection