Reinforcement Guided Multi-Task Learning Framework for Low-Resource Stereotype Detection
Rajkumar Pujari, Erik Oveson, Priyanka Kulkarni, Elnaz Nouri

TL;DR
This paper introduces a multi-task learning framework guided by reinforcement learning to improve stereotype detection in text, utilizing auxiliary tasks and addressing dataset reliability issues for better performance.
Contribution
It presents a novel reinforcement-guided multi-task learning approach that leverages related tasks to enhance stereotype detection accuracy.
Findings
Significant empirical improvements over baselines.
Effective use of neighboring tasks for stereotype detection.
Reinforcement learning guides example selection for better training.
Abstract
As large Pre-trained Language Models (PLMs) trained on large amounts of data in an unsupervised manner become more ubiquitous, identifying various types of bias in the text has come into sharp focus. Existing "Stereotype Detection" datasets mainly adopt a diagnostic approach toward large PLMs. Blodgett et. al (2021a) show that there are significant reliability issues with the existing benchmark datasets. Annotating a reliable dataset requires a precise understanding of the subtle nuances of how stereotypes manifest in text. In this paper, we annotate a focused evaluation set for "Stereotype Detection" that addresses those pitfalls by de-constructing various ways in which stereotypes manifest in text. Further, we present a multi-task model that leverages the abundance of data-rich neighboring tasks such as hate speech detection, offensive language detection, misogyny detection, etc., to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
