Fair and Argumentative Language Modeling for Computational Argumentation
Carolin Holtermann, Anne Lauscher, Simone Paolo Ponzetto

TL;DR
This paper investigates bias in computational argumentation language models, introduces a new bias measurement resource, and demonstrates effective, sustainable debiasing methods that preserve or enhance argument quality prediction performance.
Contribution
It introduces ABBA, a novel bias measurement resource for argumentation, and evaluates lightweight debiasing techniques that effectively reduce bias while maintaining or improving downstream task performance.
Findings
Bias can be effectively reduced in argumentative language models.
Debiasing methods can improve downstream argument quality prediction.
Sustainable, parameter-efficient debiasing is feasible.
Abstract
Although much work in NLP has focused on measuring and mitigating stereotypical bias in semantic spaces, research addressing bias in computational argumentation is still in its infancy. In this paper, we address this research gap and conduct a thorough investigation of bias in argumentative language models. To this end, we introduce ABBA, a novel resource for bias measurement specifically tailored to argumentation. We employ our resource to assess the effect of argumentative fine-tuning and debiasing on the intrinsic bias found in transformer-based language models using a lightweight adapter-based approach that is more sustainable and parameter-efficient than full fine-tuning. Finally, we analyze the potential impact of language model debiasing on the performance in argument quality prediction, a downstream task of computational argumentation. Our results show that we are able to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Hate Speech and Cyberbullying Detection
