'Quis custodiet ipsos custodes?' Who will watch the watchmen? On   Detecting AI-generated peer-reviews

Sandeep Kumar; Mohit Sahu; Vardhan Gacche; Tirthankar Ghosal; Asif; Ekbal

arXiv:2410.09770·cs.CL·October 15, 2024

'Quis custodiet ipsos custodes?' Who will watch the watchmen? On Detecting AI-generated peer-reviews

Sandeep Kumar, Mohit Sahu, Vardhan Gacche, Tirthankar Ghosal, Asif, Ekbal

PDF

Open Access 1 Repo

TL;DR

This paper introduces two models, TF and RR, to detect AI-generated peer reviews, demonstrating their effectiveness and robustness against attacks, thereby aiding editors in maintaining review integrity.

Contribution

The paper presents novel detection models specifically designed for identifying AI-generated peer reviews, addressing real-world challenges and improving over existing generic detectors.

Findings

01

RR model is more robust against paraphrasing attacks

02

Both models outperform existing AI text detectors

03

TF model performs better without attacks

Abstract

The integrity of the peer-review process is vital for maintaining scientific rigor and trust within the academic community. With the steady increase in the usage of large language models (LLMs) like ChatGPT in academic writing, there is a growing concern that AI-generated texts could compromise scientific publishing, including peer-reviews. Previous works have focused on generic AI-generated text detection or have presented an approach for estimating the fraction of peer-reviews that can be AI-generated. Our focus here is to solve a real-world problem by assisting the editor or chair in determining whether a review is written by ChatGPT or not. To address this, we introduce the Term Frequency (TF) model, which posits that AI often repeats tokens, and the Review Regeneration (RR) model, which is based on the idea that ChatGPT generates similar outputs upon re-prompting. We stress test…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sandeep82945/ai-review-detection
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education

MethodsFocus