Leveraging In-Context Learning for Political Bias Testing of LLMs

Patrick Haller; Jannis Vamvas; Rico Sennrich; Lena A. J\"ager

arXiv:2506.22232·cs.CL·June 30, 2025

Leveraging In-Context Learning for Political Bias Testing of LLMs

Patrick Haller, Jannis Vamvas, Rico Sennrich, Lena A. J\"ager

PDF

Open Access 1 Video

TL;DR

This paper introduces Questionnaire Modeling (QM), a new method using human survey data as in-context examples to improve the stability and reliability of political bias testing in large language models (LLMs).

Contribution

It proposes QM as a novel probing task that enhances bias evaluation stability and enables comparison between instruction-tuned and base LLMs, revealing how tuning and size affect bias.

Findings

01

QM improves bias evaluation stability

02

Instruction tuning can alter bias direction

03

Larger models leverage in-context examples more effectively

Abstract

A growing body of work has been querying LLMs with political questions to evaluate their potential biases. However, this probing method has limited stability, making comparisons between models unreliable. In this paper, we argue that LLMs need more context. We propose a new probing task, Questionnaire Modeling (QM), that uses human survey data as in-context examples. We show that QM improves the stability of question-based bias evaluation, and demonstrate that it may be used to compare instruction-tuned models to their base versions. Experiments with LLMs of various sizes indicate that instruction tuning can indeed change the direction of bias. Furthermore, we observe a trend that larger models are able to leverage in-context examples more effectively, and generally exhibit smaller bias scores in QM. Data and code are publicly available.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Leveraging In-Context Learning for Political Bias Testing of LLMs· underline

Taxonomy

TopicsTopic Modeling · Computational and Text Analysis Methods · Natural Language Processing Techniques