CiteFusion: An Ensemble Framework for Citation Intent Classification Harnessing Dual-Model Binary Couples and SHAP Analyses
Lorenzo Paolini, Sahar Vahdati, Angelo Di Iorio, Robert Wardenga, Ivan Heibi, Silvio Peroni

TL;DR
CiteFusion is an ensemble framework that combines dual-model binary classifiers and SHAP analyses to improve citation intent classification accuracy and interpretability on scholarly datasets.
Contribution
This work introduces CiteFusion, a novel ensemble approach using binary decomposition, dual models, and interpretability techniques for citation intent classification.
Findings
Achieves state-of-the-art Macro-F1 scores of 89.60% on SciCite
Demonstrates robustness in imbalanced and data-scarce scenarios
Provides interpretability through SHAP analyses and structural context incorporation
Abstract
Understanding the motivations underlying scholarly citations is essential to evaluate research impact and promote transparent scholarly communication. This study introduces CiteFusion, an ensemble framework designed to address the multi-class Citation Intent Classification task on two benchmark datasets: SciCite and ACL-ARC. The framework employs a one-vs-all decomposition of the multi-class task into class-specific binary subtasks, leveraging complementary pairs of SciBERT and XLNet models, independently tuned, for each citation intent. The outputs of these base models are aggregated through a feedforward neural network meta-classifier to reconstruct the original classification task. To enhance interpretability, SHAP (SHapley Additive exPlanations) is employed to analyze token-level contributions, and interactions among base models, providing transparency into the classification…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Residual Connection · Byte Pair Encoding · Layer Normalization · Linear Layer · Linear Warmup With Linear Decay · Adam · Dropout · SentencePiece
