Your Reviews Replicate You: LLM-Based Agents as Customer Digital Twins for Conjoint Analysis
Bin Xuan, Jungmin Hwang, Hakyeon Lee

TL;DR
This paper introduces a novel LLM-based customer digital twin framework for conjoint analysis, enabling scalable, cost-effective preference estimation through virtual respondents with high predictive accuracy.
Contribution
It presents a new method combining retrieval-augmented generation and prompt engineering to create virtual customer agents that accurately predict real user preferences.
Findings
Customer digital twins predict user preferences with 87.73% accuracy.
The framework effectively quantifies attribute trade-offs in product categories.
The approach offers a scalable, cost-efficient alternative to traditional conjoint analysis.
Abstract
Conjoint analysis is a cornerstone of market research for estimating consumer preferences; however, traditional methods face persistent challenges regarding time, cost, and respondent fatigue. To address these limitations, this study proposes a framework that utilizes large language model (LLM)-based "customer digital twins (CDT)" as virtual respondents. We identified active users within the Reddit community and aggregated their comprehensive review histories to construct individualized vector databases. By integrating retrieval-augmented generation (RAG) with prompt engineering, this study developed customer agents capable of dynamically retrieving and reasoning upon their specific past preferences and constraints. These customer agents, called CDTs, performed pairwise comparison tasks on product profiles generated via fractional factorial design, and the resulting choice data was…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
