Policies Permitting LLM Use for Polishing Peer Reviews Are Currently Not Enforceable
Rounak Saha, Gurusha Juneja, Dayita Chaudhuri, Naveeja Sajeevan, Nihar B Shah, Danish Pruthi

TL;DR
This study evaluates the enforceability of policies banning LLM use in peer reviews, revealing that current detectors often misclassify human-AI collaborations, thus questioning the reliability of AI detection in academic peer review contexts.
Contribution
The paper systematically assesses the effectiveness of state-of-the-art AI detectors on simulated peer reviews, highlighting their limitations and the challenges in reliably identifying AI-assisted reviews.
Findings
All detectors misclassify some human-AI collaborative reviews as AI-generated.
Incorporating peer-review-specific signals offers limited detection improvements.
Current detectors are unreliable for accurately identifying AI use in peer reviews.
Abstract
A number of scientific conferences and journals have recently enacted policies that prohibit LLM usage by peer reviewers, except for polishing, paraphrasing, and grammar correction of otherwise human-written reviews. But, are these policies enforceable? To answer this question, we assemble a dataset of peer reviews simulating multiple levels of human-AI collaboration, and evaluate five state-of-the-art detectors, including two commercial systems. Our analysis shows that all detectors misclassify a non-trivial fraction of LLM-polished reviews as AI-generated, thereby risking false accusations of academic misconduct. We further investigate whether peer-review-specific signals, including access to the paper manuscript and the constrained domain of scientific writing, can be leveraged to improve detection. While incorporating such signals yields measurable gains in some settings, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAcademic integrity and plagiarism · Academic Publishing and Open Access · scientometrics and bibliometrics research
