Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective
Svetlana Kiritchenko, Isar Nejadgholi, Kathleen C. Fraser

TL;DR
This survey reviews NLP approaches to online abusive language detection, emphasizing ethical and human rights considerations to ensure technology benefits society without causing harm or discrimination.
Contribution
It introduces an ethical framework based on eight principles to evaluate and guide NLP research and applications in online abuse detection.
Findings
Current NLP models achieve high accuracy but risk unintended harms.
Ethical principles are essential for responsible deployment of abuse detection tech.
Opportunities include rights-respecting solutions like nudging and value-sensitive design.
Abstract
The pervasiveness of abusive content on the internet can lead to severe psychological and physical harm. Significant effort in Natural Language Processing (NLP) research has been devoted to addressing this problem through abusive content detection and related sub-areas, such as the detection of hate speech, toxicity, cyberbullying, etc. Although current technologies achieve high classification performance in research studies, it has been observed that the real-life application of this technology can cause unintended harms, such as the silencing of under-represented groups. We review a large body of NLP research on automatic abuse detection with a new focus on ethical challenges, organized around eight established ethical principles: privacy, accountability, safety and security, transparency and explainability, fairness and non-discrimination, human control of technology, professional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
