Watermark in the Classroom: A Conformal Framework for Adaptive AI Usage Detection
Yangxinyu Xie, Xuyang Chen, Zhimei Ren, Weijie J. Su

TL;DR
This paper introduces a conformal watermarking framework for AI usage detection in education, effectively controlling false positive rates and accommodating diverse student groups and AI editing levels.
Contribution
It adapts watermarking detection methods with conformal techniques for classroom settings, addressing bias and false positives in AI detection.
Findings
Effective FPR control across diverse classroom scenarios
Robust detection across multiple AI editing levels
Quantitative framework for academic integrity enforcement
Abstract
As artificial intelligence tools become ubiquitous in education, maintaining academic integrity while accommodating pedagogically beneficial AI assistance presents unprecedented challenges. Current AI detection systems fail to control false positive rates (FPR) and suffer from bias against minority student groups, prompting institutional suspensions of these technologies. Watermarking techniques offer statistical rigor through precise -values but remain untested in educational contexts where students may use varying levels of permitted AI edits. We present the first adaptation of watermarking-based detection methods for classroom settings, introducing conformal methods that effectively control FPR across diverse classroom settings. Using essays from native and non-native English speakers, we simulate seven levels of AI editing interventions--from grammar correction to content…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Digital and Cyber Forensics · Advanced Steganography and Watermarking Techniques
