Re2: A Consistency-ensured Dataset for Full-stage Peer Review and Multi-turn Rebuttal Discussions

Daoze Zhang; Zhijian Bao; Sihang Du; Zhiyi Zhao; Kuangling Zhang; Dezheng Bao; Yang Yang

arXiv:2505.07920·cs.CL·March 16, 2026

Re2: A Consistency-ensured Dataset for Full-stage Peer Review and Multi-turn Rebuttal Discussions

Daoze Zhang, Zhijian Bao, Sihang Du, Zhiyi Zhao, Kuangling Zhang, Dezheng Bao, Yang Yang

PDF

1 Datasets

TL;DR

The paper introduces Re2, a large, consistency-ensured peer review and rebuttal dataset designed to improve AI-assisted review processes and support multi-turn discussions for manuscript refinement.

Contribution

It presents the largest peer review dataset with consistency checks, supporting both static review tasks and dynamic LLM-assisted rebuttal interactions.

Findings

01

Largest peer review dataset with nearly 20,000 submissions

02

Supports multi-turn rebuttal and review interactions

03

Enhances data quality for AI review assistance

Abstract

Peer review is a critical component of scientific progress in the fields like AI, but the rapid increase in submission volume has strained the reviewing system, which inevitably leads to reviewer shortages and declines review quality. Besides the growing research popularity, another key factor in this overload is the repeated resubmission of substandard manuscripts, largely due to the lack of effective tools for authors to self-evaluate their work before submission. Large Language Models (LLMs) show great promise in assisting both authors and reviewers, and their performance is fundamentally limited by the quality of the peer review data. However, existing peer review datasets face three major limitations: (1) limited data diversity, (2) inconsistent and low-quality data due to the use of revised rather than initial submissions, and (3) insufficient support for tasks involving rebuttal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Daoze/ReviewRebuttal
dataset· 225 dl
225 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.