Imitate Before Detect: Aligning Machine Stylistic Preference for   Machine-Revised Text Detection

Jiaqi Chen; Xiaoye Zhu; Tianyang Liu; Ying Chen; Xinhui Chen; Yiwen; Yuan; Chak Tou Leong; Zuchao Li; Tang Long; Lei Zhang; Chenyu Yan; Guanghao; Mei; Jie Zhang; Lefei Zhang

arXiv:2412.10432·cs.CL·December 24, 2024

Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection

Jiaqi Chen, Xiaoye Zhu, Tianyang Liu, Ying Chen, Xinhui Chen, Yiwen, Yuan, Chak Tou Leong, Zuchao Li, Tang Long, Lei Zhang, Chenyu Yan, Guanghao, Mei, Jie Zhang, Lefei Zhang

PDF

Open Access 1 Video

TL;DR

This paper introduces Imitate Before Detect (ImBD), a novel method that aligns a language model's style preferences to detect machine-revised text by comparing style distributions, significantly improving detection accuracy across various scenarios.

Contribution

The paper proposes a new style alignment approach, Style Preference Optimization (SPO), to effectively detect machine-revised text by modeling machine-style token distributions.

Findings

01

13% increase in AUC for open-source LLM revisions

02

5% improvement in GPT-3.5 revision detection

03

Surpasses GPT-Zero with minimal samples and time

Abstract

Large Language Models (LLMs) have revolutionized text generation, making detecting machine-generated text increasingly challenging. Although past methods have achieved good performance on detecting pure machine-generated text, those detectors have poor performance on distinguishing machine-revised text (rewriting, expansion, and polishing), which can have only minor changes from its original human prompt. As the content of text may originate from human prompts, detecting machine-revised text often involves identifying distinctive machine styles, e.g., worded favored by LLMs. However, existing methods struggle to detect machine-style phrasing hidden within the content contributed by humans. We propose the "Imitate Before Detect" (ImBD) approach, which first imitates the machine-style token distribution, and then compares the distribution of the text to be tested with the machine-style…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection· underline

Taxonomy

TopicsNatural Language Processing Techniques · Handwritten Text Recognition Techniques · Topic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Cosine Annealing · Residual Connection · Linear Layer · Linear Warmup With Cosine Annealing · Weight Decay · Softmax · Attention Dropout · Dropout