Polarization by Default: Auditing Recommendation Bias in LLM-Based Content Curation
Nicol\`o Pagan, Christopher Barrie, Chris Andrew Bail, Petter T\"ornberg

TL;DR
This study systematically examines biases in LLM-based content curation across major providers and social media platforms, revealing polarization, sentiment, and demographic biases that vary with prompt strategies.
Contribution
It provides a comprehensive simulation analysis of recommendation biases in LLMs, highlighting how biases differ across providers, prompts, and social media platforms.
Findings
Biases vary significantly across providers and prompts.
Polarization is amplified in all configurations.
Political bias favors left-leaning authors on Twitter/X.
Abstract
Large Language Models (LLMs) are increasingly deployed to curate and rank human-created content, yet the nature and structure of their biases in these tasks remains poorly understood: which biases are robust across providers and platforms, and which can be mitigated through prompt design. We present a controlled simulation study mapping content selection biases across three major LLM providers (OpenAI, Anthropic, Google) on real social media datasets from Twitter/X, Bluesky, and Reddit, using six prompting strategies (\textit{general}, \textit{popular}, \textit{engaging}, \textit{informative}, \textit{controversial}, \textit{neutral}). Through 540,000 simulated top-10 selections from pools of 100 posts across 54 experimental conditions, we find that biases differ substantially in how structural and how prompt-sensitive they are. Polarization is amplified across all configurations,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
