Identifying the sources of ideological bias in GPT models through linguistic variation in output
Christina Walker, Joan C. Timoneda

TL;DR
This paper investigates ideological biases in GPT models by analyzing linguistic variations across different languages, revealing biases linked to training data and filtering policies, and emphasizing the need for curated datasets.
Contribution
It introduces a novel method to detect ideological bias in GPT models through linguistic analysis across languages with contrasting political attitudes.
Findings
GPT responses are more conservative in Polish and more liberal in Swedish.
Bias differences persist from GPT-3.5 to GPT-4 despite filtering policies.
Training data quality is crucial to reduce ideological bias.
Abstract
Extant work shows that generative AI models such as GPT-3.5 and 4 perpetuate social stereotypes and biases. One concerning but less explored source of bias is ideology. Do GPT models take ideological stances on politically sensitive topics? In this article, we provide an original approach to identifying ideological bias in generative models, showing that bias can stem from both the training data and the filtering algorithm. We leverage linguistic variation in countries with contrasting political attitudes to evaluate bias in average GPT responses to sensitive political topics in those languages. First, we find that GPT output is more conservative in languages that map well onto conservative societies (i.e., Polish), and more liberal in languages used uniquely in liberal societies (i.e., Swedish). This result provides strong evidence of training data bias in GPT models. Second,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Absolute Position Encodings · Label Smoothing · Position-Wise Feed-Forward Layer · Residual Connection · Attention Dropout · Linear Layer · Discriminative Fine-Tuning
