WarrantScore: Modeling Warrants between Claims and Evidence for Substantiation Evaluation in Peer Reviews
Kiyotada Mori, Shohei Tanaka, Tosho Hirasawa, Tadashi Kozuno, Koichiro Yoshino, Yoshitaka Ushiku

TL;DR
This paper introduces WarrantScore, a new metric that evaluates the logical inference between claims and evidence in peer reviews, aiming to improve the interpretability and accuracy of substantiation assessment in scientific review comments.
Contribution
It proposes a novel evaluation metric that assesses logical inference between claims and evidence, enhancing the interpretability and correlation with human judgments in peer review analysis.
Findings
Achieves higher correlation with human scores than conventional methods
Improves assessment of logical inference in scientific reviews
Supports more efficient peer-review processes
Abstract
The scientific peer-review process is facing a shortage of human resources due to the rapid growth in the number of submitted papers. The use of language models to reduce the human cost of peer review has been actively explored as a potential solution to this challenge. A method has been proposed to evaluate the level of substantiation in scientific reviews in a manner that is interpretable by humans. This method extracts the core components of an argument, claims and evidence, and assesses the level of substantiation based on the proportion of claims supported by evidence. The level of substantiation refers to the extent to which claims are based on objective facts. However, when assessing the level of substantiation, simply detecting the presence or absence of supporting evidence for a claim is insufficient; it is also necessary to accurately assess the logical inference between a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExpert finding and Q&A systems · scientometrics and bibliometrics research · Academic integrity and plagiarism
