Stylometric Detection of AI-Generated Text in Twitter Timelines
Tharindu Kumarage, Joshua Garland, Amrita Bhattacharjee, Kirill, Trapeznikov, Scott Ruston, Huan Liu

TL;DR
This paper introduces a stylometric-based algorithm to detect AI-generated tweets and identify when AI starts generating content within Twitter timelines, addressing challenges posed by short text lengths.
Contribution
It proposes novel stylometric models for distinguishing human and AI tweets and detecting the onset of AI-generated content in timelines, improving detection accuracy.
Findings
Stylometric features effectively augment existing detectors.
Models successfully discriminate between human and AI tweets.
Detection of AI-generated content onset is feasible in Twitter timelines.
Abstract
Recent advancements in pre-trained language models have enabled convenient methods for generating human-like text at a large scale. Though these generation capabilities hold great potential for breakthrough applications, it can also be a tool for an adversary to generate misinformation. In particular, social media platforms like Twitter are highly susceptible to AI-generated misinformation. A potential threat scenario is when an adversary hijacks a credible user account and incorporates a natural language generator to generate misinformation. Such threats necessitate automated detectors for AI-generated tweets in a given user's Twitter timeline. However, tweets are inherently short, thus making it difficult for current state-of-the-art pre-trained language model-based detectors to accurately detect at what point the AI starts to generate tweets in a given Twitter timeline. In this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling · Misinformation and Its Impacts · Topic Modeling
