Wisdom from Diversity: Bias Mitigation Through Hybrid Human-LLM Crowds

Axel Abels; Tom Lenaerts

arXiv:2505.12349·cs.CL·May 20, 2025

Wisdom from Diversity: Bias Mitigation Through Hybrid Human-LLM Crowds

Axel Abels, Tom Lenaerts

PDF

Open Access

TL;DR

This paper investigates bias mitigation in large language models by analyzing responses, demonstrating that hybrid human-LLM crowds with weighted aggregation strategies can effectively reduce biases and improve accuracy.

Contribution

It introduces hybrid human-LLM crowds and locally weighted aggregation methods as novel strategies for bias mitigation and performance enhancement in LLMs.

Findings

01

Averaging LLM responses can increase bias due to limited diversity.

02

Locally weighted aggregation reduces bias and improves accuracy.

03

Hybrid crowds of humans and LLMs further decrease biases and enhance performance.

Abstract

Despite their performance, large language models (LLMs) can inadvertently perpetuate biases found in the data they are trained on. By analyzing LLM responses to bias-eliciting headlines, we find that these models often mirror human biases. To address this, we explore crowd-based strategies for mitigating bias through response aggregation. We first demonstrate that simply averaging responses from multiple LLMs, intended to leverage the "wisdom of the crowd", can exacerbate existing biases due to the limited diversity within LLM crowds. In contrast, we show that locally weighted aggregation methods more effectively leverage the wisdom of the LLM crowd, achieving both bias mitigation and improved accuracy. Finally, recognizing the complementary strengths of LLMs (accuracy) and humans (diversity), we demonstrate that hybrid crowds containing both significantly enhance performance and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Mobile Crowdsensing and Crowdsourcing · Artificial Intelligence in Healthcare and Education