Fine-Tuning Pretrained Language Models With Label Attention for   Biomedical Text Classification

Bruce Nguyen; Shaoxiong Ji

arXiv:2108.11809·cs.CL·March 8, 2022

Fine-Tuning Pretrained Language Models With Label Attention for Biomedical Text Classification

Bruce Nguyen, Shaoxiong Ji

PDF

Open Access

TL;DR

This paper introduces a transformer-based biomedical text classifier that incorporates label descriptions through a label attention module during fine-tuning, improving classification performance on medical datasets.

Contribution

It presents a novel label attention mechanism integrated into pretrained language models for biomedical text classification, leveraging label descriptions for better accuracy.

Findings

01

Outperforms vanilla PTMs on two public datasets

02

Achieves state-of-the-art results in biomedical text classification

03

Demonstrates the effectiveness of label-aware fine-tuning

Abstract

The massive scale and growth of textual biomedical data have made its indexing and classification increasingly important. However, existing research on this topic mainly utilized convolutional and recurrent neural networks, which generally achieve inferior performance than the novel transformers. On the other hand, systems that apply transformers only focus on the target documents, overlooking the rich semantic information that label descriptions contain. To address this gap, we develop a transformer-based biomedical text classifier that considers label information. The system achieves this with a label attention module incorporated into the fine-tuning process of pretrained language models (PTMs). Our results on two public medical datasets show that the proposed fine-tuning scheme outperforms the vanilla PTMs and state-of-the-art models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Text and Document Classification Technologies