Do Large Language Models Plan Answer Positions? Position Bias in Multiple-Choice Question Generation
Xuemei Tang, Xufeng Duan, Zhenguang G. Cai

TL;DR
This paper investigates how large language models exhibit systematic position biases when generating multiple-choice questions, revealing implicit planning and demonstrating control over answer positioning.
Contribution
It uncovers structured position biases in LLM-generated MCQs, links these biases to internal representations, and introduces activation steering to influence answer placement.
Findings
LLMs show consistent position biases across different models and tasks.
Hidden representations encode signals predicting answer positions.
Activation steering can manipulate and partially control answer placement.
Abstract
Large language models (LLMs) are increasingly used to generate multiple-choice questions (MCQs), where correct answers should ideally be uniformly distributed across options. However, we observe that LLMs exhibit systematic position biases during generation. Through extensive experiments with 10 LLMs and 5 vision-language models (VLMs) on three MCQ generation tasks, we show that these biases are structured, with similar patterns emerging within model families. To investigate the underlying mechanisms, we conduct probing experiments and find that hidden representations in the question stem encode predictive signals of the correct answer position, suggesting that answer position may be implicitly planned during generation. Building on this insight, we apply activation steering to manipulate internal representations and influence answer position. Our results show that steering can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
