A Detailed Factor Analysis for the Political Compass Test: Navigating Ideologies of Large Language Models
Sadia Kamal, Lalu Prasad Yadav Prakash, S M Rafiuddin, Mohammed Rakib, Atriya Sen, Sagnik Ray Choudhury

TL;DR
This paper investigates how prompt phrasing and fine-tuning influence political bias scores in large language models, revealing that these factors significantly affect results and questioning the validity of current bias measurement methods.
Contribution
It provides a detailed statistical analysis of factors affecting political bias assessments in LLMs, highlighting the impact of prompt phrasing and fine-tuning on test outcomes.
Findings
Prompt phrasing and fine-tuning significantly influence PCT scores.
Changes to standard generation parameters have minimal effect.
Models do not change responses to neutral prompts or texts.
Abstract
The Political Compass Test (PCT) and similar surveys are commonly used to assess political bias in auto-regressive LLMs. Our rigorous statistical experiments show that while changes to standard generation parameters have minimal effect on PCT scores, prompt phrasing and fine-tuning individually and together can significantly influence results. Interestingly, fine-tuning on politically rich vs. neutral datasets does not lead to different shifts in scores. We also generalize these findings to a similar popular test called 8 Values. Humans do not change their responses to questions when prompted differently (``answer this question'' vs ``state your opinion''), or after exposure to politically neutral text, such as mathematical formulae. But the fact that the models do so raises concerns about the validity of these tests for measuring model bias, and paves the way for deeper exploration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Media Influence and Politics · Populism, Right-Wing Movements
