The Algorithmic Caricature: Auditing LLM-Generated Political Discourse Across Crisis Events
Gunjan, Sidahmed Benabderrahmane, Talal Rahwan

TL;DR
This study compares synthetic and observed political discourse across nine crisis events, revealing that AI-generated texts are fluent but less realistic at the population level, especially in fast-moving crises.
Contribution
It introduces a novel population-level auditing framework for evaluating the social realism of LLM-generated political discourse across diverse crisis contexts.
Findings
Synthetic discourse is more negative and less dispersed in sentiment.
Synthetic texts are structurally more regular and lexically more abstract.
Differences are larger in fast-moving, decentralized crises.
Abstract
Large Language Models (LLMs) can generate fluent political text at scale, raising concerns about synthetic discourse during crises and social conflict. Existing AI-text detection often focuses on sentence-level cues such as perplexity, burstiness, or token irregularities, but these signals may weaken as generative systems improve. We instead adopt a Computational Social Science perspective and ask whether synthetic political discourse behaves like an observed online population. We construct a paired corpus of 1,789,406 posts across nine crisis events: COVID-19, the Jan. 6 Capitol attack, the 2020 and 2024 U.S. elections, Dobbs/Roe v. Wade, the 2020 BLM protests, U.S. midterms, the Utah shooting, and the U.S.-Iran war. For each event, we compare observed discourse from social platforms with synthetic discourse generated for the same context. We evaluate four dimensions: emotional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
