Measuring Spiritual Values and Bias of Large Language Models
Songyuan Liu, Ziyang Zhang, Runze Yan, Wei Wu, Carl Yang, Jiaying Lu

TL;DR
This paper investigates the spiritual values and biases present in large language models, revealing diverse spiritual perspectives and their influence on social fairness tasks, and proposes pre-training on spiritual texts to mitigate bias.
Contribution
It introduces a method to evaluate spiritual biases in LLMs and demonstrates how targeted pre-training can reduce such biases.
Findings
LLMs exhibit diverse spiritual values contrary to stereotypes.
Spiritual values influence LLMs' sensitivity in hate speech detection.
Pre-training on spiritual texts helps mitigate spiritual bias.
Abstract
Large language models (LLMs) have become integral tool for users from various backgrounds. LLMs, trained on vast corpora, reflect the linguistic and cultural nuances embedded in their pre-training data. However, the values and perspectives inherent in this data can influence the behavior of LLMs, leading to potential biases. As a result, the use of LLMs in contexts involving spiritual or moral values necessitates careful consideration of these underlying biases. Our work starts with verification of our hypothesis by testing the spiritual values of popular LLMs. Experimental results show that LLMs' spiritual values are quite diverse, as opposed to the stereotype of atheists or secularists. We then investigate how different spiritual values affect LLMs in social-fairness scenarios e.g., hate speech identification). Our findings reveal that different spiritual values indeed lead to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReligion, Spirituality, and Psychology
