SpecDetect: Simple, Fast, and Training-Free Detection of LLM-Generated Text via Spectral Analysis
Haitong Luo, Weiyao Zhang, Suhang Wang, Wenji Zou, Chungang Lin, Xuying Meng, Yujun Zhang

TL;DR
This paper introduces SpecDetect, a fast, training-free method for detecting LLM-generated text by analyzing spectral energy in token probability signals, outperforming existing methods.
Contribution
The paper presents a novel spectral analysis approach to detect LLM-generated text, emphasizing signal energy differences, and introduces SpecDetect++ with enhanced robustness.
Findings
Spectral energy is higher in human-written text than LLM-generated text.
SpecDetect outperforms state-of-the-art detectors in accuracy.
The method runs nearly twice as fast as existing approaches.
Abstract
The proliferation of high-quality text from Large Language Models (LLMs) demands reliable and efficient detection methods. While existing training-free approaches show promise, they often rely on surface-level statistics and overlook fundamental signal properties of the text generation process. In this work, we reframe detection as a signal processing problem, introducing a novel paradigm that analyzes the sequence of token log-probabilities in the frequency domain. By systematically analyzing the signal's spectral properties using the global Discrete Fourier Transform (DFT) and the local Short-Time Fourier Transform (STFT), we find that human-written text consistently exhibits significantly higher spectral energy. This higher energy reflects the larger-amplitude fluctuations inherent in human writing compared to the suppressed dynamics of LLM-generated text. Based on this key insight,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques
