Tackling Bias in Pre-trained Language Models: Current Trends and   Under-represented Societies

Vithya Yogarajan; Gillian Dobbie; Te Taka Keegan; Rostam J. Neuwirth

arXiv:2312.01509·cs.CY·December 5, 2023·5 cites

Tackling Bias in Pre-trained Language Models: Current Trends and Under-represented Societies

Vithya Yogarajan, Gillian Dobbie, Te Taka Keegan, Rostam J. Neuwirth

PDF

Open Access

TL;DR

This paper surveys current methods for identifying and mitigating bias in large language models, emphasizing the unique needs of under-represented societies and highlighting the importance of tailored approaches.

Contribution

It provides a comprehensive overview of bias mitigation techniques and explores the perspective of under-represented societies, proposing the need for adapted solutions.

Findings

01

Current bias mitigation methods are insufficient for under-represented societies

02

Existing techniques need adaptation to address societal diversity

03

Case studies from New Zealand illustrate tailored requirements

Abstract

The benefits and capabilities of pre-trained language models (LLMs) in current and future innovations are vital to any society. However, introducing and using LLMs comes with biases and discrimination, resulting in concerns about equality, diversity and fairness, and must be addressed. While understanding and acknowledging bias in LLMs and developing mitigation strategies are crucial, the generalised assumptions towards societal needs can result in disadvantages towards under-represented societies and indigenous populations. Furthermore, the ongoing changes to actual and proposed amendments to regulations and laws worldwide also impact research capabilities in tackling the bias problem. This research presents a comprehensive survey synthesising the current trends and limitations in techniques used for identifying and mitigating bias in LLMs, where the overview of methods for tackling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Interpreting and Communication in Healthcare · Natural Language Processing Techniques