HEARTS: A Holistic Framework for Explainable, Sustainable and Robust Text Stereotype Detection
Theo King, Zekun Wu, Adriano Koshiyama, Emre Kazim, Philip Treleaven

TL;DR
HEARTS is a comprehensive framework that improves stereotype detection in text by combining high performance, explainability, and sustainability, using a new diverse dataset and interpretability techniques.
Contribution
The paper introduces HEARTS, a novel holistic framework for stereotype detection that integrates a new dataset, model performance, explainability, and environmental sustainability.
Findings
Fine-tuned BERT models outperform individual components.
ALBERT-V2 with SHAP provides human-aligned explanations.
HEARTS reduces carbon footprint while maintaining accuracy.
Abstract
Stereotypes are generalised assumptions about societal groups, and even state-of-the-art LLMs using in-context learning struggle to identify them accurately. Due to the subjective nature of stereotypes, where what constitutes a stereotype can vary widely depending on cultural, social, and individual perspectives, robust explainability is crucial. Explainable models ensure that these nuanced judgments can be understood and validated by human users, promoting trust and accountability. We address these challenges by introducing HEARTS (Holistic Framework for Explainable, Sustainable, and Robust Text Stereotype Detection), a framework that enhances model performance, minimises carbon footprint, and provides transparent, interpretable explanations. We establish the Expanded Multi-Grain Stereotype Dataset (EMGSD), comprising 57,201 labelled texts across six groups, including under-represented…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Softmax · Layer Normalization · Dropout · Attention Dropout · WordPiece · Dense Connections · Residual Connection · Linear Layer
