Bias in Decision-Making for AI's Ethical Dilemmas: A Comparative Study of ChatGPT and Claude
Wentao Xu, Yile Yan, Yuqi Zhu

TL;DR
This study systematically evaluates biases in nine large language models' responses to ethical dilemmas, revealing significant protected attribute biases and differences based on model type and dilemma context, emphasizing the need for nuanced fairness assessments.
Contribution
It provides a comprehensive, systematic comparison of ethical biases across multiple LLMs in various dilemma scenarios, highlighting the influence of model type and input complexity.
Findings
All models exhibit significant biases related to protected attributes.
Open-source LLMs show stronger biases towards marginalized groups.
Biases are more pronounced in intersectional attribute scenarios.
Abstract
Recent advances in Large Language Models (LLMs) have enabled human-like responses across various tasks, raising questions about their ethical decision-making capabilities and potential biases. This study systematically evaluates how nine popular LLMs (both open-source and closed-source) respond to ethical dilemmas involving protected attributes. Across 50,400 trials spanning single and intersectional attribute combinations in four dilemma scenarios (protective vs. harmful), we assess models' ethical preferences, sensitivity, stability, and clustering patterns. Results reveal significant biases in protected attributes in all models, with differing preferences depending on model type and dilemma context. Notably, open-source LLMs show stronger preferences for marginalized groups and greater sensitivity in harmful scenarios, while closed-source models are more selective in protective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Artificial Intelligence in Healthcare and Education · Explainable Artificial Intelligence (XAI)
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Weight Decay · Multi-Head Attention · Layer Normalization · Dense Connections · Cosine Annealing
