Towards Simulating Social Media Users with LLMs: Evaluating the Operational Validity of Conditioned Comment Prediction

Nils Schwager; Simon M\"unker; Alistair Plum; Achim Rettinger

arXiv:2602.22752·cs.CL·March 27, 2026

Towards Simulating Social Media Users with LLMs: Evaluating the Operational Validity of Conditioned Comment Prediction

Nils Schwager, Simon M\"unker, Alistair Plum, Achim Rettinger

PDF

Open Access 2 Models 1 Video

TL;DR

This paper introduces Conditioned Comment Prediction to evaluate LLMs' ability to simulate social media user behavior, revealing insights on prompting strategies, fine-tuning effects, and operational validity in multilingual low-resource scenarios.

Contribution

It proposes a new evaluation framework for LLMs as social media user simulators and analyzes the effects of prompting and fine-tuning on their behavioral fidelity.

Findings

01

Fine-tuning aligns text structure but reduces semantic grounding.

02

Explicit biographies become unnecessary after fine-tuning.

03

Operational validity depends on using authentic behavioral traces.

Abstract

The transition of Large Language Models (LLMs) from exploratory tools to active "silicon subjects" in social science lacks extensive validation of operational validity. This study introduces Conditioned Comment Prediction (CCP), a task in which a model predicts how a user would comment on a given stimulus by comparing generated outputs with authentic digital traces. This framework enables a rigorous evaluation of current LLM capabilities with respect to the simulation of social media user behavior. We evaluated open-weight 8B models (Llama3.1, Qwen3, Ministral) in English, German, and Luxembourgish language scenarios. By systematically comparing prompting strategies (explicit vs. implicit) and the impact of Supervised Fine-Tuning (SFT), we identify a critical form vs. content decoupling in low-resource settings: while SFT aligns the surface structure of the text output (length and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

Towards Simulating Social Media Users with LLMs: Evaluating the Operational Validity of Conditioned Comment Prediction· underline

Taxonomy

TopicsPersona Design and Applications · Topic Modeling · Mental Health via Writing