Limits to Predicting Online Speech Using Large Language Models

Mina Remeli; Moritz Hardt; Robert C. Williamson

arXiv:2407.12850·cs.CL·January 7, 2026·1 cites

Limits to Predicting Online Speech Using Large Language Models

Mina Remeli, Moritz Hardt, Robert C. Williamson

PDF

Open Access 1 Repo

TL;DR

This study investigates the limits of predicting online speech with large language models, revealing that user-specific context greatly enhances prediction accuracy and that certain social features are learned in-context, with consistent results across diverse demographics.

Contribution

The paper provides a comprehensive analysis of predictability limits in modeling user-generated content on Twitter, highlighting the importance of user history and social features in language model performance.

Findings

01

Predicting individual user posts remains challenging.

02

Using user history significantly improves prediction accuracy.

03

Models learn social features like @-mentions and hashtags in-context.

Abstract

Our paper studies the predictability of online speech -- that is, how well language models learn to model the distribution of user generated content on X (previously Twitter). We define predictability as a measure of the model's uncertainty, i.e. its negative log-likelihood. As the basis of our study, we collect 10M tweets for ``tweet-tuning'' base models and a further 6.25M posts from more than five thousand X (previously Twitter) users and their peers. In our study involving more than 5000 subjects, we find that predicting posts of individual users remains surprisingly hard. Moreover, it matters greatly what context is used: models using the users' own history significantly outperform models using posts from their social circle. We validate these results across four large language models ranging in size from 1.5 billion to 70 billion parameters. Moreover, our results replicate if…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

socialfoundations/twitter-predictability
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis

MethodsBalanced Selection