Evaluating Gender Bias in Large Language Models via Chain-of-Thought   Prompting

Masahiro Kaneko; Danushka Bollegala; Naoaki Okazaki; Timothy Baldwin

arXiv:2401.15585·cs.CL·January 30, 2024·6 cites

Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting

Masahiro Kaneko, Danushka Bollegala, Naoaki Okazaki, Timothy Baldwin

PDF

Open Access

TL;DR

This paper investigates how Chain-of-Thought prompting influences gender bias in large language models during unscalable tasks, finding that CoT reduces social bias and promotes fairer predictions.

Contribution

It introduces a benchmark for gender bias in counting tasks and demonstrates that CoT prompting can mitigate implicit gender biases in LLMs.

Findings

01

CoT prompting reduces gender bias in LLM predictions.

02

Most LLMs exhibit social bias without CoT.

03

Explicit step-by-step reasoning encourages fairer outcomes.

Abstract

There exist both scalable tasks, like reading comprehension and fact-checking, where model performance improves with model size, and unscalable tasks, like arithmetic reasoning and symbolic reasoning, where model performance does not necessarily improve with model size. Large language models (LLMs) equipped with Chain-of-Thought (CoT) prompting are able to make accurate incremental predictions even on unscalable tasks. Unfortunately, despite their exceptional reasoning abilities, LLMs tend to internalize and reproduce discriminatory societal biases. Whether CoT can provide discriminatory or egalitarian rationalizations for the implicit information in unscalable tasks remains an open question. In this study, we examine the impact of LLMs' step-by-step predictions on gender bias in unscalable tasks. For this purpose, we construct a benchmark for an unscalable task where the LLM is given…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Hate Speech and Cyberbullying Detection

MethodsChain-of-thought prompting