MindVote: When AI Meets the Wild West of Social Media Opinion
Xutao Mao, Ezra Xuanru Tao, Leyao Wang

TL;DR
MindVote introduces a new benchmark based on real social media data to evaluate LLMs' ability to predict public opinion, addressing limitations of traditional survey-based assessments.
Contribution
This paper presents MindVote, the first benchmark grounded in authentic social media discourse for evaluating LLMs' public opinion prediction capabilities.
Findings
15 LLMs evaluated on MindVote benchmark
Benchmark includes 3,918 naturalistic polls from Reddit and Weibo
Provides a more ecologically valid assessment of LLMs' social intelligence
Abstract
Large Language Models (LLMs) are increasingly used as scalable tools for pilot testing, predicting public opinion distributions before deploying costly surveys. To serve as effective pilot testing tools, the performance of these LLMs is typically benchmarked against their ability to reproduce the outcomes of past structured surveys. This evaluation paradigm, however, is misaligned with the dynamic, context-rich social media environments where public opinion is increasingly formed and expressed. By design, surveys strip away the social, cultural, and temporal context that shapes public opinion, and LLM benchmarks built on this paradigm inherit these critical limitations. To bridge this gap, we introduce MindVote, the first benchmark for public opinion distribution prediction grounded in authentic social media discourse. MindVote is constructed from 3,918 naturalistic polls sourced from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsEthics and Social Impacts of AI · Misinformation and Its Impacts
