Automatic Extraction of Medication Names in Tweets as Named Entity   Recognition

Carol Anderson; Bo Liu; Anas Abidin; Hoo-Chang Shin; Virginia Adams

arXiv:2111.15641·cs.CL·December 1, 2021

Automatic Extraction of Medication Names in Tweets as Named Entity Recognition

Carol Anderson, Bo Liu, Anas Abidin, Hoo-Chang Shin, Virginia Adams

PDF

Open Access

TL;DR

This paper presents a method for recognizing medication mentions in tweets using ensemble fine-tuning of BERT models, achieving high accuracy in identifying medical references in social media posts.

Contribution

It introduces an ensemble approach with Megatron-BERT models for medication named entity recognition in tweets, improving upon previous methods.

Findings

01

Achieved a strict F1 score of 0.764 on test data.

02

Ensemble of five Megatron-BERT models outperforms individual models.

03

Demonstrated effectiveness of BERT-based models for social media medical text analysis.

Abstract

Social media posts contain potentially valuable information about medical conditions and health-related behavior. Biocreative VII Task 3 focuses on mining this information by recognizing mentions of medications and dietary supplements in tweets. We approach this task by fine tuning multiple BERT-style language models to perform token-level classification, and combining them into ensembles to generate final predictions. Our best system consists of five Megatron-BERT-345M models and achieves a strict F1 score of 0.764 on unseen test data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques