Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-based Reranking
Nianlong Gu, Yingqiang Gao, Richard H.R. Hahnloser

TL;DR
This paper introduces a hierarchical attention text encoder combined with SciBERT-based reranking for efficient and accurate local citation recommendation, outperforming existing methods on multiple datasets.
Contribution
It proposes a novel hierarchical attention encoder for prefetching and integrates it with SciBERT reranking, achieving state-of-the-art results in local citation recommendation.
Findings
High prefetch recall with hierarchical attention encoder
Fewer candidates needed for reranking
State-of-the-art performance on multiple datasets
Abstract
The goal of local citation recommendation is to recommend a missing reference from the local citation context and optionally also from the global context. To balance the tradeoff between speed and accuracy of citation recommendation in the context of a large-scale paper database, a viable approach is to first prefetch a limited number of relevant documents using efficient ranking methods and then to perform a fine-grained reranking using more sophisticated models. In that vein, BM25 has been found to be a tough-to-beat approach to prefetching, which is why recent work has focused mainly on the reranking step. Even so, we explore prefetching with nearest neighbor search among text embeddings constructed by a hierarchical attention network. When coupled with a SciBERT reranker fine-tuned on local citation recommendation tasks, our hierarchical Attention encoder (HAtten) achieves high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Text and Document Classification Technologies
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
