ReviewGrounder: Improving Review Substantiveness with Rubric-Guided, Tool-Integrated Agents

Zhuofeng Li; Yi Lu; Dongfu Jiang; Haoxiang Zhang; Yuyang Bai; Chuan Li; Yu Wang; Shuiwang Ji; Jianwen Xie; Yu Zhang

arXiv:2604.14261·cs.CL·April 17, 2026

ReviewGrounder: Improving Review Substantiveness with Rubric-Guided, Tool-Integrated Agents

Zhuofeng Li, Yi Lu, Dongfu Jiang, Haoxiang Zhang, Yuyang Bai, Chuan Li, Yu Wang, Shuiwang Ji, Jianwen Xie, Yu Zhang

PDF

1 Repo

TL;DR

ReviewGrounder enhances AI-generated peer reviews by integrating explicit rubrics and contextual grounding, leading to more substantive, evidence-based feedback that aligns better with human judgments.

Contribution

It introduces REVIEWBENCH for evaluating reviews and REVIEWGROUNDER, a multi-agent framework that improves review quality through rubric-guided drafting and grounding stages.

Findings

01

REVIEWGROUNDER outperforms baselines in review quality metrics.

02

The framework shows improved alignment with human judgments.

03

Using larger models further enhances review quality.

Abstract

The rapid rise in AI conference submissions has driven increasing exploration of large language models (LLMs) for peer review support. However, LLM-based reviewers often generate superficial, formulaic comments lacking substantive, evidence-grounded feedback. We attribute this to the underutilization of two key components of human reviewing: explicit rubrics and contextual grounding in existing work. To address this, we introduce REVIEWBENCH, a benchmark evaluating review text according to paper-specific rubrics derived from official guidelines, the paper's content, and human-written reviews. We further propose REVIEWGROUNDER, a rubric-guided, tool-integrated multi-agent framework that decomposes reviewing into drafting and grounding stages, enriching shallow drafts via targeted evidence consolidation. Experiments on REVIEWBENCH show that REVIEWGROUNDER, using a Phi-4-14B-based drafter…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

EigenTom/ReviewGrounder
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.