Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting
Masahiro Kaneko, Danushka Bollegala, Naoaki Okazaki, Timothy Baldwin

TL;DR
This paper investigates how Chain-of-Thought prompting influences gender bias in large language models during unscalable tasks, finding that CoT reduces social bias and promotes fairer predictions.
Contribution
It introduces a benchmark for gender bias in counting tasks and demonstrates that CoT prompting can mitigate implicit gender biases in LLMs.
Findings
CoT prompting reduces gender bias in LLM predictions.
Most LLMs exhibit social bias without CoT.
Explicit step-by-step reasoning encourages fairer outcomes.
Abstract
There exist both scalable tasks, like reading comprehension and fact-checking, where model performance improves with model size, and unscalable tasks, like arithmetic reasoning and symbolic reasoning, where model performance does not necessarily improve with model size. Large language models (LLMs) equipped with Chain-of-Thought (CoT) prompting are able to make accurate incremental predictions even on unscalable tasks. Unfortunately, despite their exceptional reasoning abilities, LLMs tend to internalize and reproduce discriminatory societal biases. Whether CoT can provide discriminatory or egalitarian rationalizations for the implicit information in unscalable tasks remains an open question. In this study, we examine the impact of LLMs' step-by-step predictions on gender bias in unscalable tasks. For this purpose, we construct a benchmark for an unscalable task where the LLM is given…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Hate Speech and Cyberbullying Detection
MethodsChain-of-thought prompting
