CatRAG: Functor-Guided Structural Debiasing with Retrieval Augmentation for Fair LLMs
Ravi Ranjan, Utkarsh Grover, Mayur Akewar, Xiaomin Lin, Agoritsa Polyzou

TL;DR
CatRAG introduces a novel structure-preserving debiasing framework for large language models that combines functor-guided embedding adjustments with retrieval-augmented generation, significantly reducing biases and improving fairness.
Contribution
The paper presents a dual-pronged debiasing method integrating category-theoretic functors with retrieval-augmented generation, achieving state-of-the-art fairness results in LLMs.
Findings
Achieves up to 40% accuracy improvement over base models.
Reduces bias scores to near zero across multiple demographic subgroups.
Outperforms prior debiasing methods by more than 10%.
Abstract
Large Language Models (LLMs) are deployed in high-stakes settings but can show demographic, gender, and geographic biases that undermine fairness and trust. Prior debiasing methods, including embedding-space projections, prompt-based steering, and causal interventions, often act at a single stage of the pipeline, resulting in incomplete mitigation and brittle utility trade-offs under distribution shifts. We propose CatRAG Debiasing, a dual-pronged framework that integrates functor with Retrieval-Augmented Generation (RAG) guided structural debiasing. The functor component leverages category-theoretic structure to induce a principled, structure-preserving projection that suppresses bias-associated directions in the embedding space while retaining task-relevant semantics. On the Bias Benchmark for Question Answering (BBQ) across three open-source LLMs (Meta Llama-3, OpenAI GPT-OSS, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Topic Modeling · Artificial Intelligence in Healthcare and Education
