TL;DR
FactReview is an evidence-grounded peer review system that extracts claims, positions literature, verifies claims through code execution, and produces detailed, label-based review reports.
Contribution
The paper introduces FactReview, a novel system combining claim extraction, literature positioning, and execution-based verification to enhance AI-assisted peer review.
Findings
FactReview reproduces results closely matching original claims.
It identifies when broader performance claims are not fully supported.
The system provides evidence-based labels for each major claim.
Abstract
Peer review in machine learning is under growing pressure from rising submission volume and limited reviewer time. Most LLM-based reviewing systems read only the manuscript and generate comments from the paper's own narrative. This makes their outputs sensitive to presentation quality and leaves them weak when the evidence needed for review lies in related work or released code. We present FactReview, an evidence-grounded reviewing system that combines claim extraction, literature positioning, and execution-based claim verification. Given a submission, FactReview identifies major claims and reported results, retrieves nearby work to clarify the paper's technical position, and, when code is available, executes the released repository under bounded budgets to test central empirical claims. It then produces a concise review and an evidence report that assigns each major claim one of five…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
