Fairness-Aware Retrieval Optimization for Retrieval-Augmented Generation

Yingqi Zhao; Vasilis Efthymiou; Jyrki Nummenmaa; Kostas Stefanidis

arXiv:2605.15790·cs.DB·May 18, 2026

Fairness-Aware Retrieval Optimization for Retrieval-Augmented Generation

Yingqi Zhao, Vasilis Efthymiou, Jyrki Nummenmaa, Kostas Stefanidis

PDF

TL;DR

This paper introduces a fairness-aware retrieval framework for Retrieval-Augmented Generation systems that mitigates bias propagation while maintaining relevance, using a novel optimization approach called FARO.

Contribution

It presents a new framework combining bias control, position-aware modeling, and an efficient optimization method for fairness in RAG retrievals.

Findings

01

Effectively reduces bias in generated outputs.

02

Preserves relevance while improving fairness.

03

Scalable optimization method outperforms baselines.

Abstract

Retrieval-Augmented Generation (RAG) improves reliability of large language models by incorporating external knowledge, but the retrieval process can introduce bias that propagates to generated outputs. This issue is particularly challenging in top-k settings, where multiple documents jointly influence generation. We propose a fairness-aware retrieval framework that models and controls this bias. Our approach combines controlled bias injection via reranking, a position-aware model of bias propagation, and an optimization formulation that balances relevance and fairness. We further introduce a scalable solution based on Quadratic Fairness via Dual Hyperplane Approximation (FARO), which enables efficient optimization through problem decomposition. Experimental results show that our method effectively mitigates generation bias while preserving relevance. This work provides a principled…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.