Linguistic Uncertainty and Engagement in Arabic-Language X (formerly Twitter) Discourse
Mohamed Soufan

TL;DR
This study investigates how linguistic uncertainty in Arabic-language tweets about Lebanon influences user engagement, revealing that uncertain tweets tend to receive significantly higher and more conversational interactions.
Contribution
It introduces a lexicon-based classifier for identifying uncertainty in Arabic social media and demonstrates its positive correlation with increased user engagement.
Findings
Uncertain tweets have 51.5% higher mean engagement.
Uncertainty correlates with approximately 25% higher expected engagement.
Engagement increases are strongest for replies, then retweets and likes.
Abstract
Linguistic uncertainty is a common feature of social media discourse, yet its relationship with user engagement remains underexplored, particularly in non-English contexts. Using a dataset of 16,695 Arabic-language tweets about Lebanon posted over a 35-day period, we examine whether tweets expressing linguistic uncertainty receive different levels and forms of engagement compared to certainty-marked tweets. We develop a lexicon-based, context-sensitive classifier to identify uncertainty markers and classify 29.9% of tweets as uncertain. Descriptive analyses indicate that uncertain tweets exhibit 51.5% higher mean total engagement (likes, retweets, and replies). Regression models controlling for tweet length, URL presence, and account verification status confirm a positive association between uncertainty and engagement (\b{eta} = 0.221, SE = 0.044, p < 0.001), corresponding to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Digital Communication and Language · Impact of Technology on Adolescents
