Fine-Tuning Language Models on Multiple Datasets for Citation Intention Classification
Zeren Shui, Petros Karypis, Daniel S. Karls, Mingjian Wen, Saurav, Manchanda, Ellad B. Tadmor, George Karypis

TL;DR
This paper introduces a multi-task learning framework with task relation learning to improve citation intention classification by fine-tuning language models on multiple datasets, enhancing generalization especially on small datasets.
Contribution
It proposes a novel multi-task learning approach with data-driven task relation learning to effectively leverage auxiliary datasets during fine-tuning of PLMs for CIC.
Findings
Improved performance on small CIC datasets by 7-11%.
Outperforms current state-of-the-art models.
Enhances generalization through auxiliary dataset integration.
Abstract
Citation intention Classification (CIC) tools classify citations by their intention (e.g., background, motivation) and assist readers in evaluating the contribution of scientific literature. Prior research has shown that pretrained language models (PLMs) such as SciBERT can achieve state-of-the-art performance on CIC benchmarks. PLMs are trained via self-supervision tasks on a large corpus of general text and can quickly adapt to CIC tasks via moderate fine-tuning on the corresponding dataset. Despite their advantages, PLMs can easily overfit small datasets during fine-tuning. In this paper, we propose a multi-task learning (MTL) framework that jointly fine-tunes PLMs on a dataset of primary interest together with multiple auxiliary CIC datasets to take advantage of additional supervision signals. We develop a data-driven task relation learning (TRL) method that controls the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling
