Policies Permitting LLM Use for Polishing Peer Reviews Are Currently Not Enforceable

Rounak Saha; Gurusha Juneja; Dayita Chaudhuri; Naveeja Sajeevan; Nihar B Shah; Danish Pruthi

arXiv:2603.20450·cs.CL·March 24, 2026

Policies Permitting LLM Use for Polishing Peer Reviews Are Currently Not Enforceable

Rounak Saha, Gurusha Juneja, Dayita Chaudhuri, Naveeja Sajeevan, Nihar B Shah, Danish Pruthi

PDF

Open Access

TL;DR

This study evaluates the enforceability of policies banning LLM use in peer reviews, revealing that current detectors often misclassify human-AI collaborations, thus questioning the reliability of AI detection in academic peer review contexts.

Contribution

The paper systematically assesses the effectiveness of state-of-the-art AI detectors on simulated peer reviews, highlighting their limitations and the challenges in reliably identifying AI-assisted reviews.

Findings

01

All detectors misclassify some human-AI collaborative reviews as AI-generated.

02

Incorporating peer-review-specific signals offers limited detection improvements.

03

Current detectors are unreliable for accurately identifying AI use in peer reviews.

Abstract

A number of scientific conferences and journals have recently enacted policies that prohibit LLM usage by peer reviewers, except for polishing, paraphrasing, and grammar correction of otherwise human-written reviews. But, are these policies enforceable? To answer this question, we assemble a dataset of peer reviews simulating multiple levels of human-AI collaboration, and evaluate five state-of-the-art detectors, including two commercial systems. Our analysis shows that all detectors misclassify a non-trivial fraction of LLM-polished reviews as AI-generated, thereby risking false accusations of academic misconduct. We further investigate whether peer-review-specific signals, including access to the paper manuscript and the constrained domain of scientific writing, can be leveraged to improve detection. While incorporating such signals yields measurable gains in some settings, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAcademic integrity and plagiarism · Academic Publishing and Open Access · scientometrics and bibliometrics research