FairReason: Balancing Reasoning and Social Bias in MLLMs

Zhenyu Pan; Yutong Zhang; Jianshu Zhang; Haoran Lu; Haozheng Luo; Yuwei Han; Philip S. Yu; Manling Li; Han Liu

arXiv:2507.23067·cs.AI·September 9, 2025

FairReason: Balancing Reasoning and Social Bias in MLLMs

Zhenyu Pan, Yutong Zhang, Jianshu Zhang, Haoran Lu, Haozheng Luo, Yuwei Han, Philip S. Yu, Manling Li, Han Liu

PDF

Open Access

TL;DR

This paper investigates how to balance reasoning capabilities and social bias mitigation in Multimodal Large Language Models by benchmarking strategies and exploring trade-offs, providing practical guidance for fair and effective models.

Contribution

It systematically compares bias-mitigation methods and analyzes the reasoning-bias trade-off, offering a data-driven approach to optimize both fairness and reasoning in MLLMs.

Findings

01

Reinforcement learning with a 1:4 mix reduces stereotypes by 10%.

02

The same mix retains 88% of reasoning accuracy.

03

Benchmarking reveals strengths and weaknesses of different bias mitigation strategies.

Abstract

Multimodal Large Language Models (MLLMs) already achieve state-of-the-art results across a wide range of tasks and modalities. To push their reasoning ability further, recent studies explore advanced prompting schemes and post-training fine-tuning. Although these techniques improve logical accuracy, they frequently leave the models' outputs burdened with pronounced social biases. Clarifying how reasoning gains interact with bias mitigation-and whether the two objectives inherently trade off-therefore remains an open and pressing research problem. Our study begins by benchmarking three bias-mitigation strategies-supervised fine-uning (SFT), knowledge distillation (KD), and rule-based reinforcement learning (RL)-under identical conditions, establishing their baseline strengths and weaknesses. Building on these results, we vary the proportion of debias-focused and reasoning-centric samples…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation