Fine-Tuning Language Models on Multiple Datasets for Citation Intention   Classification

Zeren Shui; Petros Karypis; Daniel S. Karls; Mingjian Wen; Saurav; Manchanda; Ellad B. Tadmor; George Karypis

arXiv:2410.13332·cs.CL·October 18, 2024

Fine-Tuning Language Models on Multiple Datasets for Citation Intention Classification

Zeren Shui, Petros Karypis, Daniel S. Karls, Mingjian Wen, Saurav, Manchanda, Ellad B. Tadmor, George Karypis

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a multi-task learning framework with task relation learning to improve citation intention classification by fine-tuning language models on multiple datasets, enhancing generalization especially on small datasets.

Contribution

It proposes a novel multi-task learning approach with data-driven task relation learning to effectively leverage auxiliary datasets during fine-tuning of PLMs for CIC.

Findings

01

Improved performance on small CIC datasets by 7-11%.

02

Outperforms current state-of-the-art models.

03

Enhances generalization through auxiliary dataset integration.

Abstract

Citation intention Classification (CIC) tools classify citations by their intention (e.g., background, motivation) and assist readers in evaluating the contribution of scientific literature. Prior research has shown that pretrained language models (PLMs) such as SciBERT can achieve state-of-the-art performance on CIC benchmarks. PLMs are trained via self-supervision tasks on a large corpus of general text and can quickly adapt to CIC tasks via moderate fine-tuning on the corresponding dataset. Despite their advantages, PLMs can easily overfit small datasets during fine-tuning. In this paper, we propose a multi-task learning (MTL) framework that jointly fine-tunes PLMs on a dataset of primary interest together with multiple auxiliary CIC datasets to take advantage of additional supervision signals. We develop a data-driven task relation learning (TRL) method that controls the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shuix007/deep-citation
pytorchOfficial

Videos

Fine-Tuning Language Models on Multiple Datasets for Citation Intention Classification· underline

Taxonomy

TopicsTopic Modeling