LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users

Elinor Poole-Dayan; Deb Roy; Jad Kabbara

arXiv:2406.17737·cs.CL·November 7, 2025·1 cites

LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users

Elinor Poole-Dayan, Deb Roy, Jad Kabbara

PDF

Open Access 1 Video

TL;DR

This paper investigates how the performance of large language models varies across different user traits, revealing that vulnerable users such as non-native English speakers and those with lower education are disproportionately affected by inaccuracies and undesirable behaviors.

Contribution

The study provides extensive empirical analysis showing that LLMs underperform for vulnerable user groups based on language proficiency, education, and country of origin.

Findings

01

Undesirable behaviors occur more for users with lower English proficiency.

02

Lower-education users experience more inaccuracies from LLMs.

03

Users outside the US face higher rates of model unreliability.

Abstract

While state-of-the-art large language models (LLMs) have shown impressive performance on many tasks, there has been extensive research on undesirable model behavior such as hallucinations and bias. In this work, we investigate how the quality of LLM responses changes in terms of information accuracy, truthfulness, and refusals depending on three user traits: English proficiency, education level, and country of origin. We present extensive experimentation on three state-of-the-art LLMs and two different datasets targeting truthfulness and factuality. Our findings suggest that undesirable behaviors in state-of-the-art LLMs occur disproportionately more for users with lower English proficiency, of lower education status, and originating from outside the US, rendering these models unreliable sources of information towards their most vulnerable users.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users· underline

Taxonomy

TopicsNetwork Security and Intrusion Detection