Different Bias Under Different Criteria: Assessing Bias in LLMs with a Fact-Based Approach
Changgeon Ko, Jisu Shin, Hoyun Song, Jeongyeon Seo, Jong C. Park

TL;DR
This paper introduces a fact-based bias assessment metric for LLMs, using real-world data and human perception to evaluate bias, revealing that bias varies with different criteria and emphasizing the importance of multi-perspective evaluation.
Contribution
The paper proposes a novel fact-based bias metric for LLMs, incorporating real-world statistics and human perception, offering an objective alternative to equality-based bias assessments.
Findings
Humans prefer LLM outputs aligned with real-world demographics.
Bias in LLMs varies depending on the evaluation criteria.
Multi-perspective assessment reveals different bias levels.
Abstract
Large language models (LLMs) often reflect real-world biases, leading to efforts to mitigate these effects and make the models unbiased. Achieving this goal requires defining clear criteria for an unbiased state, with any deviation from these criteria considered biased. Some studies define an unbiased state as equal treatment across diverse demographic groups, aiming for balanced outputs from LLMs. However, differing perspectives on equality and the importance of pluralism make it challenging to establish a universal standard. Alternatively, other approaches propose using fact-based criteria for more consistent and objective evaluations, though these methods have not yet been fully applied to LLM bias assessments. Thus, there is a need for a metric with objective criteria that offers a distinct perspective from equality-based approaches. Motivated by this need, we introduce a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Imbalanced Data Classification Techniques
MethodsALIGN
