TL;DR
ReviewRL is a reinforcement learning framework that automates scientific paper reviews by integrating literature, fine-tuning, and reward-based optimization, significantly improving review quality and accuracy.
Contribution
It introduces a novel RL-based approach combining literature retrieval, supervised fine-tuning, and reward optimization for automated scientific review generation.
Findings
Outperforms existing automated review methods on multiple metrics.
Effectively incorporates relevant scientific literature into reviews.
Shows promising potential for future development in automated scientific critique.
Abstract
Peer review is essential for scientific progress but faces growing challenges due to increasing submission volumes and reviewer fatigue. Existing automated review approaches struggle with factual accuracy, rating consistency, and analytical depth, often generating superficial or generic feedback lacking the insights characteristic of high-quality human reviews. We introduce ReviewRL, a reinforcement learning framework for generating comprehensive and factually grounded scientific paper reviews. Our approach combines: (1) an ArXiv-MCP retrieval-augmented context generation pipeline that incorporates relevant scientific literature, (2) supervised fine-tuning that establishes foundational reviewing capabilities, and (3) a reinforcement learning procedure with a composite reward function that jointly enhances review quality and rating accuracy. Experiments on ICLR 2025 papers demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
