ACM-CR: A Manually Annotated Test Collection for Citation Recommendation

Florian Boudin

arXiv:2108.07571·cs.IR·August 18, 2021

ACM-CR: A Manually Annotated Test Collection for Citation Recommendation

Florian Boudin

PDF

Open Access 1 Repo

TL;DR

This paper introduces ACM-CR, a high-quality, manually annotated test collection for citation recommendation, aiming to improve evaluation reliability over existing noisy datasets.

Contribution

It presents a new manually annotated dataset for citation recommendation and evaluates baseline models, facilitating more accurate future research.

Findings

01

Baseline models show moderate effectiveness on ACM-CR

02

Manual annotation improves dataset reliability

03

Provides open access to dataset and code

Abstract

Citation recommendation is intended to assist researchers in the process of searching for relevant papers to cite by recommending appropriate citations for a given input text. Existing test collections for this task are noisy and unreliable since they are built automatically from parsed PDF papers. In this paper, we present our ongoing effort at creating a publicly available, manually annotated test collection for citation recommendation. We also conduct a series of experiments to evaluate the effectiveness of content-based baseline models on the test collection, providing results for future work to improve upon. Our test collection and code to replicate experiments are available at https://github.com/boudinfl/acm-cr

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

boudinfl/acm-cr
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Expert finding and Q&A systems · Topic Modeling