Why Did You Not Compare With That? Identifying Papers for Use as   Baselines

Manjot Bedi; Tanisha Pandey; Sumit Bhatia; Tanmoy Chakraborty

arXiv:2201.08089·cs.CL·January 21, 2022

Why Did You Not Compare With That? Identifying Papers for Use as Baselines

Manjot Bedi, Tanisha Pandey, Sumit Bhatia, Tanmoy Chakraborty

PDF

Open Access 1 Repo

TL;DR

This paper introduces a neural classification approach to automatically identify baseline papers in scientific articles, addressing the challenge of diverse citation appearances and outperforming existing methods.

Contribution

It presents a new dataset of annotated references and a multi-module attention neural classifier for baseline identification, advancing citation role classification.

Findings

01

The classifier outperforms four state-of-the-art methods.

02

A new dataset of 2,075 papers with annotated references is created.

03

Analysis reveals key challenges in baseline identification.

Abstract

We propose the task of automatically identifying papers used as baselines in a scientific article. We frame the problem as a binary classification task where all the references in a paper are to be classified as either baselines or non-baselines. This is a challenging problem due to the numerous ways in which a baseline reference can appear in a paper. We develop a dataset of $2, 075$ papers from ACL anthology corpus with all their references manually annotated as one of the two classes. We develop a multi-module attention-based neural classifier for the baseline classification task that outperforms four state-of-the-art citation role classification methods when applied to the baseline classification task. We also present an analysis of the errors made by the proposed classifier, eliciting the challenges that make baseline identification a challenging problem.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sumit-research/baseline-search
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Advanced Text Analysis Techniques