View Distillation with Unlabeled Data for Extracting Adverse Drug   Effects from User-Generated Data

Payam Karisani; Jinho D. Choi; Li Xiong

arXiv:2105.11354·cs.CL·May 25, 2021

View Distillation with Unlabeled Data for Extracting Adverse Drug Effects from User-Generated Data

Payam Karisani, Jinho D. Choi, Li Xiong

PDF

Open Access

TL;DR

This paper introduces a novel view distillation approach using unlabeled social media data and multi-layer transformers to improve the extraction of adverse drug reactions, significantly outperforming existing models.

Contribution

It proposes a new multi-view distillation method leveraging unlabeled data and transformer embeddings for ADR detection, enhancing performance over pretrained models.

Findings

01

Model outperforms domain-specific pretrained transformers on ADR data

02

Utilizes unlabeled social media data effectively for ADR extraction

03

Significant improvement in ADR detection accuracy

Abstract

We present an algorithm based on multi-layer transformers for identifying Adverse Drug Reactions (ADR) in social media data. Our model relies on the properties of the problem and the characteristics of contextual word embeddings to extract two views from documents. Then a classifier is trained on each view to label a set of unlabeled documents to be used as an initializer for a new classifier in the other view. Finally, the initialized classifier in each view is further trained using the initial training examples. We evaluated our model in the largest publicly available ADR dataset. The experiments testify that our model significantly outperforms the transformer-based models pretrained on domain-specific data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Natural Language Processing Techniques