Deep Neural Networks Ensemble for Detecting Medication Mentions in Tweets
Davy Weissenbacher, Abeed Sarker, Ari Klein, Karen O'Connor, Arjun, Magge Ranganatha, Graciela Gonzalez-Hernandez

TL;DR
This paper introduces Kusuri, an ensemble deep learning system that accurately detects medication mentions in Twitter posts, even with misspellings and ambiguity, achieving near-human performance on balanced datasets.
Contribution
Kusuri combines multiple classifiers and deep neural networks to improve medication mention detection in tweets, addressing challenges of misspellings and unbalanced data.
Findings
Achieved 93.7% F1-score on balanced dataset
Reached 76.3% F1-score on unbalanced dataset of real Twitter data
Outperformed existing systems on similar tasks
Abstract
Objective: After years of research, Twitter posts are now recognized as an important source of patient-generated data, providing unique insights into population health. A fundamental step to incorporating Twitter data in pharmacoepidemiological research is to automatically recognize medication mentions in tweets. Given that lexical searches for medication names may fail due to misspellings or ambiguity with common words, we propose a more advanced method to recognize them. Methods: We present Kusuri, an Ensemble Learning classifier, able to identify tweets mentioning drug products and dietary supplements. Kusuri ("medication" in Japanese) is composed of two modules. First, four different classifiers (lexicon-based, spelling-variant-based, pattern-based and one based on a weakly-trained neural network) are applied in parallel to discover tweets potentially containing medication names.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
