Confronting LLMs with Traditional ML: Rethinking the Fairness of Large   Language Models in Tabular Classifications

Yanchen Liu; Srishti Gautam; Jiaqi Ma; Himabindu Lakkaraju

arXiv:2310.14607·cs.CL·April 4, 2024·2 cites

Confronting LLMs with Traditional ML: Rethinking the Fairness of Large Language Models in Tabular Classifications

Yanchen Liu, Srishti Gautam, Jiaqi Ma, Himabindu Lakkaraju

PDF

Open Access 1 Video

TL;DR

This paper investigates how large language models (LLMs) inherit social biases from their training data, affecting fairness in tabular classification tasks, and compares their bias mitigation effectiveness to traditional models.

Contribution

It reveals that LLMs inherently carry social biases from pretraining data and that bias mitigation techniques have limited success compared to traditional models.

Findings

01

LLMs inherit social biases from training data.

02

Bias mitigation methods only moderately reduce biases.

03

Bias gap remains larger in LLMs than in traditional models.

Abstract

Recent literature has suggested the potential of using large language models (LLMs) to make classifications for tabular tasks. However, LLMs have been shown to exhibit harmful social biases that reflect the stereotypes and inequalities present in society. To this end, as well as the widespread use of tabular data in many high-stake applications, it is important to explore the following questions: what sources of information do LLMs draw upon when making classifications for tabular tasks; whether and to what extent are LLM classifications for tabular data influenced by social biases and stereotypes; and what are the consequential implications for fairness? Through a series of experiments, we delve into these questions and show that LLMs tend to inherit social biases from their training data which significantly impact their fairness in tabular classification tasks. Furthermore, our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Confronting LLMs with Traditional ML: Rethinking the Fairness of Large Language Models in Tabular Classifications· underline

Taxonomy

TopicsText Readability and Simplification