Steerable Pluralism: Pluralistic Alignment via Few-Shot Comparative Regression

Jadie Adams; Brian Hu; Emily Veenhuis; David Joy; Bharadwaj Ravichandran; Aaron Bray; Anthony Hoogs; Arslan Basharat

arXiv:2508.08509·cs.CL·August 13, 2025

Steerable Pluralism: Pluralistic Alignment via Few-Shot Comparative Regression

Jadie Adams, Brian Hu, Emily Veenhuis, David Joy, Bharadwaj Ravichandran, Aaron Bray, Anthony Hoogs, Arslan Basharat

PDF

Open Access

TL;DR

This paper introduces a steerable pluralistic alignment method for large language models that captures diverse user preferences across multiple attributes using few-shot comparative regression, outperforming existing techniques.

Contribution

The paper proposes a novel few-shot comparative regression approach for pluralistic alignment, enabling LLMs to adapt to individual preferences across multiple attributes with interpretability.

Findings

01

Outperforms baseline and state-of-the-art methods

02

Demonstrates applicability to value-aligned decision-making

03

Provides new benchmarks for pluralistic alignment

Abstract

Large language models (LLMs) are currently aligned using techniques such as reinforcement learning from human feedback (RLHF). However, these methods use scalar rewards that can only reflect user preferences on average. Pluralistic alignment instead seeks to capture diverse user preferences across a set of attributes, moving beyond just helpfulness and harmlessness. Toward this end, we propose a steerable pluralistic model based on few-shot comparative regression that can adapt to individual user preferences. Our approach leverages in-context learning and reasoning, grounded in a set of fine-grained attributes, to compare response options and make aligned choices. To evaluate our algorithm, we also propose two new steerable pluralistic benchmarks by adapting the Moral Integrity Corpus (MIC) and the HelpSteer2 datasets, demonstrating the applicability of our approach to value-aligned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Mobile Crowdsensing and Crowdsourcing