In Silico Development of Psychometric Scales: Feasibility of Representative Population Data Simulation with LLMs
Enrico Cipriani, Pavel Okopnyi, Danilo Menicucci, Simone Grassini

TL;DR
This study explores using Large Language Models to generate synthetic data for psychometric scale development, showing they can replicate group-level structures but not individual data properties, aiding early-stage testing.
Contribution
The paper demonstrates the feasibility of using LLMs to simulate psychometric data, enabling preliminary testing of scale structures before collecting real data.
Findings
Simulated data replicated factor structures in 3 of 4 studies
Group-level invariance was achieved in simulated datasets
Individual-level data properties were not accurately captured
Abstract
Developing and validating psychometric scales requires large samples, multiple testing phases, and substantial resources. Recent advances in Large Language Models (LLMs) enable the generation of synthetic participant data by prompting models to answer items while impersonating individuals of specific demographic profiles, potentially allowing in silico piloting before real data collection. Across four preregistered studies (N = circa 300 each), we tested whether LLM-simulated datasets can reproduce the latent structures and measurement properties of human responses. In Studies 1-2, we compared LLM-generated data with real datasets for two validated scales; in Studies 3-4, we created new scales using EFA on simulated data and then examined whether these structures generalized to newly collected human samples. Simulated datasets replicated the intended factor structures in three of four…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Psychometric Methodologies and Testing · Mental Health Research Topics
