How You Prompt Matters! Even Task-Oriented Constraints in Instructions Affect LLM-Generated Text Detection
Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki

TL;DR
This study demonstrates that task-oriented constraints in user instructions significantly affect the performance of LLM-generated text detectors, causing high variability and challenging detection accuracy in realistic scenarios.
Contribution
It reveals the impact of natural, task-specific constraints on detection performance and highlights the need to consider such instructions in developing robust detectors.
Findings
Detection performance variance increases with task constraints (up to SD of 14.4 F1-score).
Constraints generally make LLM detection more difficult.
High instruction-following ability of LLMs amplifies the effect of constraints.
Abstract
To combat the misuse of Large Language Models (LLMs), many recent studies have presented LLM-generated-text detectors with promising performance. When users instruct LLMs to generate texts, the instruction can include different constraints depending on the user's need. However, most recent studies do not cover such diverse instruction patterns when creating datasets for LLM detection. In this paper, we reveal that even task-oriented constraints -- constraints that would naturally be included in an instruction and are not related to detection-evasion -- cause existing powerful detectors to have a large variance in detection performance. We focus on student essay writing as a realistic domain and manually create task-oriented constraints based on several factors for essay quality. Our experiments show that the standard deviation (SD) of current detector performance on texts generated by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Text Readability and Simplification · Mathematics, Computing, and Information Processing
MethodsFocus
