Can You Detect the Difference?

\.Ismail Tar{\i}m; Aytu\u{g} Onan

arXiv:2507.10475·cs.CL·July 15, 2025

Can You Detect the Difference?

\.Ismail Tar{\i}m, Aytu\u{g} Onan

PDF

Open Access

TL;DR

This paper compares diffusion-based and autoregressive language models, revealing that diffusion models closely mimic human text and pose challenges for existing detection methods, emphasizing the need for diffusion-aware detection techniques.

Contribution

It provides the first systematic comparison of diffusion-generated and AR-generated texts, highlighting limitations of current detection metrics and proposing new directions for diffusion-aware detection.

Findings

01

LLaDA mimics human text closely, causing high false negatives.

02

LLaMA has lower perplexity but less lexical fidelity.

03

Single metrics are insufficient for detection.

Abstract

The rapid advancement of large language models (LLMs) has raised concerns about reliably detecting AI-generated text. Stylometric metrics work well on autoregressive (AR) outputs, but their effectiveness on diffusion-based models is unknown. We present the first systematic comparison of diffusion-generated text (LLaDA) and AR-generated text (LLaMA) using 2 000 samples. Perplexity, burstiness, lexical diversity, readability, and BLEU/ROUGE scores show that LLaDA closely mimics human text in perplexity and burstiness, yielding high false-negative rates for AR-oriented detectors. LLaMA shows much lower perplexity but reduced lexical fidelity. Relying on any single metric fails to separate diffusion outputs from human writing. We highlight the need for diffusion-aware detectors and outline directions such as hybrid models, diffusion-specific stylometric signatures, and robust watermarking.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling · Text Readability and Simplification · Hate Speech and Cyberbullying Detection