RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation

Sihong Wu; Yiling Ma; Yilun Zhao; Tiansheng Hu; Owen Jiang; Manasi Patwardhan; Arman Cohan

arXiv:2603.09723·cs.CL·April 29, 2026

RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation

Sihong Wu, Yiling Ma, Yilun Zhao, Tiansheng Hu, Owen Jiang, Manasi Patwardhan, Arman Cohan

PDF

1 Repo 1 Datasets

TL;DR

This paper introduces RbtAct, a method leveraging peer review rebuttals as implicit supervision to generate more actionable and specific review feedback, supported by a large dataset and improved training techniques.

Contribution

It proposes a new task and dataset for review feedback generation, and demonstrates a fine-tuned LLM that produces more actionable review comments.

Findings

01

Enhanced feedback actionability and specificity over baselines.

02

Built RMR-75K dataset linking review segments to rebuttals.

03

Model maintains relevance and grounding while improving feedback quality.

Abstract

Large language models (LLMs) are increasingly used across the scientific workflow, including to draft peer-review reports. However, many AI-generated reviews are superficial and insufficiently actionable, leaving authors without concrete, implementable guidance and motivating the gap this work addresses. We propose RbtAct, which targets actionable review feedback generation and places existing peer review rebuttal at the center of learning. Rebuttals show which reviewer comments led to concrete revisions or specific plans, and which were only defended. Building on this insight, we leverage rebuttal as implicit supervision to directly optimize a feedback generator for actionability. To support this objective, we propose a new task called perspective-conditioned segment-level review feedback generation, in which the model is required to produce a single focused comment based on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

formula12/RbtAct
github

Datasets

shwu22/RMR-75K
dataset· 62 dl
62 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.