Local Citation Recommendation with Hierarchical-Attention Text Encoder   and SciBERT-based Reranking

Nianlong Gu; Yingqiang Gao; Richard H.R. Hahnloser

arXiv:2112.01206·cs.IR·March 18, 2022·1 cites

Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-based Reranking

Nianlong Gu, Yingqiang Gao, Richard H.R. Hahnloser

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hierarchical attention text encoder combined with SciBERT-based reranking for efficient and accurate local citation recommendation, outperforming existing methods on multiple datasets.

Contribution

It proposes a novel hierarchical attention encoder for prefetching and integrates it with SciBERT reranking, achieving state-of-the-art results in local citation recommendation.

Findings

01

High prefetch recall with hierarchical attention encoder

02

Fewer candidates needed for reranking

03

State-of-the-art performance on multiple datasets

Abstract

The goal of local citation recommendation is to recommend a missing reference from the local citation context and optionally also from the global context. To balance the tradeoff between speed and accuracy of citation recommendation in the context of a large-scale paper database, a viable approach is to first prefetch a limited number of relevant documents using efficient ranking methods and then to perform a fine-grained reranking using more sophisticated models. In that vein, BM25 has been found to be a tough-to-beat approach to prefetching, which is why recent work has focused mainly on the reranking step. Even so, we explore prefetching with nearest neighbor search among text embeddings constructed by a hierarchical attention network. When coupled with a SciBERT reranker fine-tuned on local citation recommendation tasks, our hierarchical Attention encoder (HAtten) achieves high…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nianlonggu/Local-Citation-Recommendation
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Text and Document Classification Technologies

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings