DeepReviewer 2.0: A Traceable Agentic System for Auditable Scientific Peer Review
Yixuan Weng, Minjun Zhu, Qiujie Xie, Zhiyuan Ning, Shichen Li, Panzhong Lu, Zhen Lin, Enhao Gu, Qiyao Sun, Yue Zhang

TL;DR
DeepReviewer 2.0 is an advanced, traceable AI system designed for transparent and auditable scientific peer reviews, outperforming some existing models in coverage and comparison tests.
Contribution
It introduces a process-controlled, traceable review system that produces audit-friendly critique packages with anchored evidence and follow-up actions.
Findings
Outperforms Gemini-3.1-Pro in coverage and blind comparison tests.
Achieves 71.63% win rate against human reviewers in blind evaluations.
Builds a claim-evidence-risk ledger and verification agenda for transparent reviews.
Abstract
Automated peer review is often framed as generating fluent critique, yet reviewers and area chairs need judgments they can \emph{audit}: where a concern applies, what evidence supports it, and what concrete follow-up is required. DeepReviewer~2.0 is a process-controlled agentic review system built around an output contract: it produces a \textbf{traceable review package} with anchored annotations, localized evidence, and executable follow-up actions, and it exports only after meeting minimum traceability and coverage budgets. Concretely, it first builds a manuscript-only claim--evidence--risk ledger and verification agenda, then performs agenda-driven retrieval and writes anchored critiques under an export gate. On 134 ICLR~2025 submissions under three fixed protocols, an \emph{un-finetuned 196B} model running DeepReviewer~2.0 outperforms Gemini-3.1-Pro-preview, improving strict…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
