Improving LLM Group Fairness on Tabular Data via In-Context Learning
Valeriia Cherepanova, Chia-Jung Lee, Nil-Jana Akpinar, Riccardo, Fogliato, Martin Andres Bertran, Michael Kearns, James Zou

TL;DR
This paper explores methods to improve group fairness in large language models' predictions on tabular data, addressing fairness issues that are not tackled by traditional debiasing techniques, and evaluates their effectiveness across multiple datasets.
Contribution
It systematically investigates four empirical approaches—prompt optimization, soft prompt tuning, example selection, and self-refinement—for enhancing fairness in LLM predictions on tabular data.
Findings
All methods improve demographic parity across datasets.
Strategies vary in effectiveness depending on dataset and model.
Maintains high overall prediction performance.
Abstract
Large language models (LLMs) have been shown to be effective on tabular prediction tasks in the low-data regime, leveraging their internal knowledge and ability to learn from instructions and examples. However, LLMs can fail to generate predictions that satisfy group fairness, that is, produce equitable outcomes across groups. Critically, conventional debiasing approaches for natural language tasks do not directly translate to mitigating group unfairness in tabular settings. In this work, we systematically investigate four empirical approaches to improve group fairness of LLM predictions on tabular datasets, including fair prompt optimization, soft prompt tuning, strategic selection of few-shot examples, and self-refining predictions via chain-of-thought reasoning. Through experiments on four tabular datasets using both open-source and proprietary LLMs, we show the effectiveness of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Ethics and Social Impacts of AI
