LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
Do Xuan Long, Hai Nguyen Ngoc, Tiviatis Sim, Hieu Dao, Shafiq Joty,, Kenji Kawaguchi, Nancy F. Chen, Min-Yen Kan

TL;DR
This paper systematically evaluates format bias in large language models, introduces metrics to measure it, and proposes mitigation strategies that significantly reduce performance variance across different output formats.
Contribution
It is the first comprehensive study to quantify and address format bias in LLMs, offering new evaluation metrics and effective mitigation techniques.
Findings
Significant format bias exists across state-of-the-art LLMs.
Improving format-instruction following reduces format bias.
Mitigation techniques can drastically decrease performance variance.
Abstract
We present the first systematic evaluation examining format bias in performance of large language models (LLMs). Our approach distinguishes between two categories of an evaluation metric under format constraints to reliably and accurately assess performance: one measures performance when format constraints are adhered to, while the other evaluates performance regardless of constraint adherence. We then define a metric for measuring the format bias of LLMs and establish effective strategies to reduce it. Subsequently, we present our empirical format bias evaluation spanning four commonly used categories -- multiple-choice question-answer, wrapping, list, and mapping -- covering 15 widely-used formats. Our evaluation on eight generation tasks uncovers significant format bias across state-of-the-art LLMs. We further discover that improving the format-instruction following capabilities of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsResearch Data Management Practices · scientometrics and bibliometrics research · Academic Publishing and Open Access
