GPT-4's One-Dimensional Mapping of Morality: How the Accuracy of Country-Estimates Depends on Moral Domain
Pontus Strimling, Joel Krueger, Simon Karlsson

TL;DR
This study examines how GPT-4 predicts moral opinions across countries, revealing it relies mainly on a single dimension and performs better on personal-sexual issues than on violent-dishonest ones, especially in low-income countries.
Contribution
It advances understanding of GPT-4's moral prediction by analyzing its reliance on a single moral dimension and how accuracy varies across moral domains and income levels.
Findings
GPT-4 predicts high-income countries' moral opinions more accurately.
GPT-4's predictions are more accurate in the personal-sexual domain.
Predictive accuracy drops significantly in the violent-dishonest domain.
Abstract
Prior research demonstrates that Open AI's GPT models can predict variations in moral opinions between countries but that the accuracy tends to be substantially higher among high-income countries compared to low-income ones. This study aims to replicate previous findings and advance the research by examining how accuracy varies with different types of moral questions. Using responses from the World Value Survey and the European Value Study, covering 18 moral issues across 63 countries, we calculated country-level mean scores for each moral issue and compared them with GPT-4's predictions. Confirming previous findings, our results show that GPT-4 has greater predictive success in high-income than in low-income countries. However, our factor analysis reveals that GPT-4 bases its predictions primarily on a single dimension, presumably reflecting countries' degree of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
