Identifying the sources of ideological bias in GPT models through   linguistic variation in output

Christina Walker; Joan C. Timoneda

arXiv:2409.06043·cs.CL·September 11, 2024·3 cites

Identifying the sources of ideological bias in GPT models through linguistic variation in output

Christina Walker, Joan C. Timoneda

PDF

Open Access

TL;DR

This paper investigates ideological biases in GPT models by analyzing linguistic variations across different languages, revealing biases linked to training data and filtering policies, and emphasizing the need for curated datasets.

Contribution

It introduces a novel method to detect ideological bias in GPT models through linguistic analysis across languages with contrasting political attitudes.

Findings

01

GPT responses are more conservative in Polish and more liberal in Swedish.

02

Bias differences persist from GPT-3.5 to GPT-4 despite filtering policies.

03

Training data quality is crucial to reduce ideological bias.

Abstract

Extant work shows that generative AI models such as GPT-3.5 and 4 perpetuate social stereotypes and biases. One concerning but less explored source of bias is ideology. Do GPT models take ideological stances on politically sensitive topics? In this article, we provide an original approach to identifying ideological bias in generative models, showing that bias can stem from both the training data and the filtering algorithm. We leverage linguistic variation in countries with contrasting political attitudes to evaluate bias in average GPT responses to sensitive political topics in those languages. First, we find that GPT output is more conservative in languages that map well onto conservative societies (i.e., Polish), and more liberal in languages used uniquely in liberal societies (i.e., Swedish). This result provides strong evidence of training data bias in GPT models. Second,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Absolute Position Encodings · Label Smoothing · Position-Wise Feed-Forward Layer · Residual Connection · Attention Dropout · Linear Layer · Discriminative Fine-Tuning