RADAR: Robust AI-Text Detection via Adversarial Learning
Xiaomeng Hu, Pin-Yu Chen, Tsung-Yi Ho

TL;DR
RADAR introduces an adversarial training framework to enhance AI-text detection robustness against paraphrasing, significantly outperforming existing methods across multiple models and datasets.
Contribution
The paper proposes a novel adversarial learning framework called RADAR that jointly trains a paraphraser and detector to improve robustness of AI-text detection.
Findings
RADAR outperforms existing detection methods on 8 LLMs and 4 datasets.
RADAR maintains strong transferability to unseen LLMs like GPT-3.5-Turbo.
Adversarial training enhances detection robustness against paraphrasing attacks.
Abstract
Recent advances in large language models (LLMs) and the intensifying popularity of ChatGPT-like applications have blurred the boundary of high-quality text generation between humans and machines. However, in addition to the anticipated revolutionary changes to our technology and society, the difficulty of distinguishing LLM-generated texts (AI-text) from human-generated texts poses new challenges of misuse and fairness, such as fake content generation, plagiarism, and false accusations of innocent writers. While existing works show that current AI-text detectors are not robust to LLM-based paraphrasing, this paper aims to bridge this gap by proposing a new framework called RADAR, which jointly trains a robust AI-text detector via adversarial learning. RADAR is based on adversarial training of a paraphraser and a detector. The paraphraser's goal is to generate realistic content to evade…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Topic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Adam · Cosine Annealing · Weight Decay · 15 Ways to Contact How can i speak to someone at Delta Airlines · Residual Connection
