From Plausible to Causal: Counterfactual Semantics for Policy Evaluation in Simulated Online Communities
Agam Goyal, Yian Wang, Eshwar Chandrasekharan, Hari Sundaram

TL;DR
This paper introduces a causal counterfactual framework for social simulations, enabling more accurate policy evaluation by distinguishing necessity and sufficiency of interventions.
Contribution
It formalizes causal semantics in social simulations, linking simulation design to explicit causal assumptions for policy-relevant insights.
Findings
Proposes adopting counterfactual causal semantics in social simulations.
Distinguishes between necessary and sufficient causation for policy analysis.
Highlights the importance of simulation fidelity for causal inference.
Abstract
LLM-based social simulations can generate believable community interactions, enabling ``policy wind tunnels'' where governance interventions are tested before deployment. But believability is not causality. Claims like ``intervention reduces escalation'' require causal semantics that current simulation work typically does not specify. We propose adopting the causal counterfactual framework, distinguishing \textit{necessary causation} (would the outcome have occurred without the intervention?) from \textit{sufficient causation} (does the intervention reliably produce the outcome?). This distinction maps onto different stakeholder needs: moderators diagnosing incidents require evidence about necessity, while platform designers choosing policies require evidence about sufficiency. We formalize this mapping, show how simulation design can support estimation under explicit assumptions,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
