Reducing Selection Bias in Large Language Models
J. E. Eicher, R. F. Irgoli\v{c}

TL;DR
This paper investigates inherent biases in large language models affecting list selection tasks, analyzing how factors like model type and prompt design influence bias magnitude and proposing methods to mitigate these biases.
Contribution
It provides a detailed analysis of bias sources in LLMs during list selection and introduces techniques to reduce bias by separating prompt steps.
Findings
Bias varies significantly across models and object types.
Primacy effect causes first list items to dominate outputs.
Separating guard rails from sampling reduces bias.
Abstract
Large Language Models (LLMs) like gpt-3.5-turbo-0613 and claude-instant-1.2 are vital in interpreting and executing semantic tasks. Unfortunately, these models' inherent biases adversely affect their performance Particularly affected is object selection from lists; a fundamental operation in digital navigation and decision-making. This research critically examines these biases and quantifies the effects on a representative list selection task. To explore these biases, we experiment manipulating temperature, list length, object identity, object type, prompt complexity, and model. We isolated and measured the influence of the biases on selection behavior. Our findings show that bias structure is strongly dependent on the model, with object type modulating the magnitude of the effect. With a strong primacy effect, causing the first objects in a list to be disproportionately represented in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Natural Language Processing Techniques
