Improving Disease Detection from Social Media Text via Self-Augmentation   and Contrastive Learning

Pervaiz Iqbal Khan; Andreas Dengel; Sheraz Ahmed

arXiv:2405.01597·cs.CL·May 6, 2024

Improving Disease Detection from Social Media Text via Self-Augmentation and Contrastive Learning

Pervaiz Iqbal Khan, Andreas Dengel, Sheraz Ahmed

PDF

Open Access

TL;DR

This paper introduces a novel self-augmentation and contrastive learning approach to improve disease detection from social media text, leading to significant performance gains over existing methods.

Contribution

The paper proposes a new method combining self-augmentation and contrastive learning to enhance language model representations for disease detection tasks.

Findings

01

Achieved up to 2.48% higher F1-score over baseline methods.

02

Demonstrated effectiveness across multiple NLP datasets.

03

Improved generalization in social media disease classification.

Abstract

Detecting diseases from social media has diverse applications, such as public health monitoring and disease spread detection. While language models (LMs) have shown promising performance in this domain, there remains ongoing research aimed at refining their discriminating representations. In this paper, we propose a novel method that integrates Contrastive Learning (CL) with language modeling to address this challenge. Our approach introduces a self-augmentation method, wherein hidden representations of the model are augmented with their own representations. This method comprises two branches: the first branch, a traditional LM, learns features specific to the given data, while the second branch incorporates augmented representations from the first branch to encourage generalization. CL further refines these representations by pulling pairs of original and augmented versions closer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Hate Speech and Cyberbullying Detection

MethodsContrastive Learning