Improving the Distributional Alignment of LLMs using Supervision

Gauri Kambhatla; Sanjana Gautam; Angela Zhang; Alex Liu; Ravi Srinivasan; Junyi Jessy Li; Matthew Lease

arXiv:2507.00439·cs.CL·April 22, 2026

Improving the Distributional Alignment of LLMs using Supervision

Gauri Kambhatla, Sanjana Gautam, Angela Zhang, Alex Liu, Ravi Srinivasan, Junyi Jessy Li, Matthew Lease

PDF

TL;DR

This paper demonstrates that simple supervision techniques can enhance the alignment of large language models with diverse population groups across various subjective questions and datasets.

Contribution

It introduces a supervision method that improves distributional alignment of LLMs and provides a benchmark for future research across multiple datasets and models.

Findings

01

Supervision improves LLM alignment with diverse groups.

02

Alignment varies across specific population groups.

03

Benchmarking over many LLMs and prompts offers insights for future work.

Abstract

The ability to accurately align LLMs with diverse population groups on subjective questions would have great value. In this work, we show that adding simple supervision can more consistently improve the alignment of LLM-generated distributions with diverse population groups, as measured across three datasets spanning public health, public opinion, and values and beliefs. Beyond evaluating average alignment, we also report how alignment varies across specific groups. Our broad findings provide insights into the distributional alignment of LLM generations with diverse populations. By conducting evaluation over many LLMs and prompting strategies, we provide a benchmark to stimulate future research.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.