Towards Order Fairness: Mitigating LLMs Order Sensitivity through Dual Group Advantage Optimization

Xu Chu; Guanyu Wang; Zhijie Tan; Xinrong Chen; Ziyu Li; Tong Mo; Weiping Li

arXiv:2605.11974·cs.LG·May 13, 2026

Towards Order Fairness: Mitigating LLMs Order Sensitivity through Dual Group Advantage Optimization

Xu Chu, Guanyu Wang, Zhijie Tan, Xinrong Chen, Ziyu Li, Tong Mo, Weiping Li

PDF

1 Repo

TL;DR

This paper introduces DGAO, a reinforcement learning-based method to reduce order bias in LLMs, improving fairness and performance across various tasks.

Contribution

DGAO is the first approach using reinforcement learning to simultaneously enhance order stability and accuracy in LLMs.

Findings

01

DGAO achieves superior order fairness compared to previous methods.

02

DGAO improves performance on RAG, mathematical reasoning, and classification tasks.

03

New metrics, Consistency Rate and Overconfidence Rate, effectively evaluate order stability.

Abstract

Large Language Models (LLMs) suffer from order bias, where their performance is affected by the arrangement order of input elements. This unfairness limits the model's applications in scenarios such as in-context learning and Retrieval-Augmented Generation (RAG). Recent studies attempt to obtain optimal or suboptimal arrangements based on statistical results or using dataset-based search, but these methods increase inference overhead while leaving the model's inherent order bias unresolved. Other studies mitigate order sensitivity through supervised fine-tuning using augmented training sets with multiple order variants, but often at the cost of accuracy, trapping the model in consistent yet incorrect hallucinations. In this paper, we propose \textbf{D}ual \textbf{G}roup \textbf{A}dvantage \textbf{O}ptimization (\textbf{DGAO}), which aims to improve model accuracy and order stability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Hyalinesky/DGAO
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.