Walking in Others' Shoes: How Perspective-Taking Guides Large Language   Models in Reducing Toxicity and Bias

Rongwu Xu; Zi'an Zhou; Tianwei Zhang; Zehan Qi; Su Yao; Ke Xu; Wei Xu,; Han Qiu

arXiv:2407.15366·cs.CL·July 23, 2024

Walking in Others' Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias

Rongwu Xu, Zi'an Zhou, Tianwei Zhang, Zehan Qi, Su Yao, Ke Xu, Wei Xu,, Han Qiu

PDF

Open Access 1 Video

TL;DR

This paper introduces perspective-taking prompting (PeT), a novel method inspired by social psychology, that effectively reduces toxicity and bias in large language models without requiring model access or extensive retraining.

Contribution

The paper presents PeT, a new prompting strategy that enables LLMs to self-regulate responses by integrating diverse perspectives, significantly decreasing harmful content.

Findings

01

PeT reduces toxicity by up to 89%.

02

PeT decreases bias by up to 73%.

03

PeT outperforms five baseline methods across multiple LLMs.

Abstract

The common toxicity and societal bias in contents generated by large language models (LLMs) necessitate strategies to reduce harm. Present solutions often demand white-box access to the model or substantial training, which is impractical for cutting-edge commercial LLMs. Moreover, prevailing prompting methods depend on external tool feedback and fail to simultaneously lessen toxicity and bias. Motivated by social psychology principles, we propose a novel strategy named \textbf{perspective-taking prompting (\textsc{PeT})} that inspires LLMs to integrate diverse human perspectives and self-regulate their responses. This self-correction mechanism can significantly diminish toxicity (up to $89%$ ) and bias (up to $73%$ ) in LLMs' responses. Rigorous evaluations and ablation studies are conducted on two commercial LLMs (ChatGPT and GLM) and three open-source LLMs, revealing \textsc{PeT}'s…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Walking in Others' Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias· underline

Taxonomy

TopicsComputational Drug Discovery Methods