Distribution Shift Alignment Helps LLMs Simulate Survey Response Distributions
Ji Huang, Mengfei Li, Shuai Shao

TL;DR
This paper introduces Distribution Shift Alignment (DSA), a two-stage fine-tuning method for large language models that improves survey response simulation accuracy by aligning output distributions and distribution shifts, reducing data needs.
Contribution
The paper proposes DSA, a novel fine-tuning approach that better captures distribution changes, outperforming existing methods in survey response simulation tasks.
Findings
DSA outperforms other methods on five public survey datasets.
DSA reduces real data requirements by over 50%.
Empirical results show improved accuracy and robustness.
Abstract
Large language models (LLMs) offer a promising way to simulate human survey responses, potentially reducing the cost of large-scale data collection. However, existing zero-shot methods suffer from prompt sensitivity and low accuracy, while conventional fine-tuning approaches mostly fit the training set distributions and struggle to produce results more accurate than the training set itself, which deviates from the original goal of using LLMs to simulate survey responses. Building on this observation, we introduce Distribution Shift Alignment (DSA), a two-stage fine-tuning method that aligns both the output distributions and the distribution shifts across different backgrounds. By learning how these distributions change rather than fitting training data, DSA can provide results substantially closer to the true distribution than the training data. Empirically, DSA consistently outperforms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
