Polypersona: Persona-Grounded LLM for Synthetic Survey Responses
Tejaswani Dash, Dinesh Karri, Anudeep Vurity, Gautam Datla, Tazeem Ahmad, Saima Rafi, Rohith Tangudu

TL;DR
This paper presents PolyPersona, a resource-efficient framework for generating persona-conditioned survey responses with small language models, enabling scalable, coherent, and bias-analyzable synthetic survey data across multiple domains.
Contribution
It introduces a novel, resource-efficient method for synthesizing multi-domain survey responses conditioned on personas using compact models and a dialogue-based data pipeline.
Findings
Small models like TinyLlama 1.1B achieve comparable performance to larger models.
The framework produces coherent, persona-aligned survey responses.
Synthetic data supports scalable evaluation and bias analysis.
Abstract
This paper introduces PolyPersona, a generative framework for synthesizing persona-conditioned survey responses across multiple domains. The framework instruction-tunes compact chat models using parameter-efficient LoRA adapters with 4-bit quantization under a resource-adaptive training setup. A dialogue-based data pipeline explicitly preserves persona cues, ensuring consistent behavioral alignment across generated responses. Using this pipeline, we construct a dataset of 3,568 synthetic survey responses spanning ten domains and 433 distinct personas, enabling controlled instruction tuning and systematic multi-domain evaluation. We evaluate the generated responses using a multi-metric evaluation suite that combines standard text generation metrics, including BLEU, ROUGE, and BERTScore, with survey-specific metrics designed to assess structural coherence, stylistic consistency, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPersona Design and Applications · Innovative Human-Technology Interaction · Recommender Systems and Techniques
