Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection
Jiaqi Chen, Xiaoye Zhu, Tianyang Liu, Ying Chen, Xinhui Chen, Yiwen, Yuan, Chak Tou Leong, Zuchao Li, Tang Long, Lei Zhang, Chenyu Yan, Guanghao, Mei, Jie Zhang, Lefei Zhang

TL;DR
This paper introduces Imitate Before Detect (ImBD), a novel method that aligns a language model's style preferences to detect machine-revised text by comparing style distributions, significantly improving detection accuracy across various scenarios.
Contribution
The paper proposes a new style alignment approach, Style Preference Optimization (SPO), to effectively detect machine-revised text by modeling machine-style token distributions.
Findings
13% increase in AUC for open-source LLM revisions
5% improvement in GPT-3.5 revision detection
Surpasses GPT-Zero with minimal samples and time
Abstract
Large Language Models (LLMs) have revolutionized text generation, making detecting machine-generated text increasingly challenging. Although past methods have achieved good performance on detecting pure machine-generated text, those detectors have poor performance on distinguishing machine-revised text (rewriting, expansion, and polishing), which can have only minor changes from its original human prompt. As the content of text may originate from human prompts, detecting machine-revised text often involves identifying distinctive machine styles, e.g., worded favored by LLMs. However, existing methods struggle to detect machine-style phrasing hidden within the content contributed by humans. We propose the "Imitate Before Detect" (ImBD) approach, which first imitates the machine-style token distribution, and then compares the distribution of the text to be tested with the machine-style…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Handwritten Text Recognition Techniques · Topic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Cosine Annealing · Residual Connection · Linear Layer · Linear Warmup With Cosine Annealing · Weight Decay · Softmax · Attention Dropout · Dropout
