Assessing the Quality of Scientific Papers

Roman Vainshtein; Gilad Katz; Bracha Shapira; Lior Rokach

arXiv:1908.04200·cs.IR·August 13, 2019·1 cites

Assessing the Quality of Scientific Papers

Roman Vainshtein, Gilad Katz, Bracha Shapira, Lior Rokach

PDF

Open Access

TL;DR

This paper introduces a novel corpus linguistics-based method for assessing the overall quality of scientific papers within a specific field, demonstrated in computer science, and shows it effectively distinguishes high-impact papers from low-impact ones.

Contribution

The paper presents a new domain-specific quality measure and an associated classification method for scientific papers, validated in the computer science domain.

Findings

01

Significant score differences between high and low impact corpora

02

Proposed measure outperforms baseline classifier

03

Method applicable for automated scientific paper assessment

Abstract

A multitude of factors are responsible for the overall quality of scientific papers, including readability, linguistic quality, fluency,semantic complexity, and of course domain-specific technical factors. These factors vary from one field of study to another. In this paper, we propose a measure and method for assessing the overall quality of the scientific papers in a particular field of study. We evaluate our method in the computer science domain, but it can be applied to other technical and scientific fields.Our method is based on the corpus linguistics technique. This technique enables the extraction of required information and knowledge associated with a specific domain. For this purpose, we have created a large corpus, consisting of papers from very high impact conferences. First, we analyze this corpus in order to extract rich domain-specific terminology and knowledge. Then we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Natural Language Processing Techniques · Topic Modeling