SHARP: Social Harm Analysis via Risk Profiles for Measuring Inequities in Large Language Models
Alok Abhishek, Tushar Bandopadhyay, Lisa Erickson

TL;DR
This paper introduces SHARP, a multidimensional framework for evaluating social harm in large language models, emphasizing worst-case risks and distributional structure to improve understanding of model failures.
Contribution
SHARP models social harm as a multivariate distribution, integrating bias, fairness, ethics, and reliability, and employs tail-sensitive metrics like CVaR95 for comprehensive risk assessment.
Findings
Models with similar average risk differ significantly in tail exposure.
Bias shows the strongest tail severity among harm dimensions.
Heterogeneous failure structures are revealed that scalar benchmarks overlook.
Abstract
Large language models (LLMs) are increasingly deployed in high-stakes domains, where rare but severe failures can result in irreversible harm. However, prevailing evaluation benchmarks often reduce complex social risk to mean-centered scalar scores, thereby obscuring distributional structure, cross-dimensional interactions, and worst-case behavior. This paper introduces Social Harm Analysis via Risk Profiles (SHARP), a framework for multidimensional, distribution-aware evaluation of social harm. SHARP models harm as a multivariate random variable and integrates explicit decomposition into bias, fairness, ethics, and epistemic reliability with a union-of-failures aggregation reparameterized as additive cumulative log-risk. The framework further employs risk-sensitive distributional statistics, with Conditional Value at Risk (CVaR95) as a primary metric, to characterize worst-case model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Computational and Text Analysis Methods · Explainable Artificial Intelligence (XAI)
