Authorship Impersonation via LLM Prompting does not Evade Authorship Verification Methods
Baoyi Zeng, Andrea Nini

TL;DR
This study tests whether large language models can generate impersonation texts that evade existing authorship verification systems, finding current methods remain effective due to the lexical diversity of LLM outputs.
Contribution
It demonstrates that LLM-generated impersonations do not easily fool current forensic authorship verification methods across various genres.
Findings
LLM-generated texts do not bypass AV systems effectively.
Some AV methods perform better at rejecting impersonations than genuine negatives.
Lexical diversity and entropy in LLM texts contribute to AV robustness.
Abstract
Authorship verification (AV), the task of determining whether a questioned text was written by a specific individual, is a critical part of forensic linguistics. While manual authorial impersonation by perpetrators has long been a recognized threat in historical forensic cases, recent advances in large language models (LLMs) raise new challenges, as adversaries may exploit these tools to impersonate another's writing. This study investigates whether prompted LLMs can generate convincing authorial impersonations and whether such outputs can evade existing forensic AV systems. Using GPT-4o as the adversary model, we generated impersonation texts under four prompting conditions across three genres: emails, text messages, and social media posts. We then evaluated these outputs against both non-neural AV methods (n-gram tracing, Ranking-Based Impostors Method, LambdaG) and neural approaches…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
