Comparing Rule-Based and Deep Learning Models for Patient Phenotyping
Sebastian Gehrmann, Franck Dernoncourt, Yeran Li, Eric T. Carlson, Joy, T. Wu, Jonathan Welt, John Foote Jr., Edward T. Moseley, David W. Grant,, Patrick D. Tyler, Leo Anthony Celi

TL;DR
This study demonstrates that deep learning, specifically CNNs, significantly outperforms classical NLP methods in patient phenotyping tasks, offering improved accuracy and interpretability in analyzing clinical notes.
Contribution
The paper introduces a CNN-based approach for patient phenotyping that surpasses traditional methods in performance and enhances interpretability, reducing the need for extensive manual annotation.
Findings
CNNs outperform other models in all tasks
Average F1-score of 76 with CNNs
Model identifies salient phrases for predictions
Abstract
Objective: We investigate whether deep learning techniques for natural language processing (NLP) can be used efficiently for patient phenotyping. Patient phenotyping is a classification task for determining whether a patient has a medical condition, and is a crucial part of secondary analysis of healthcare data. We assess the performance of deep learning algorithms and compare them with classical NLP approaches. Materials and Methods: We compare convolutional neural networks (CNNs), n-gram models, and approaches based on cTAKES that extract pre-defined medical concepts from clinical notes and use them to predict patient phenotypes. The performance is tested on 10 different phenotyping tasks using 1,610 discharge summaries extracted from the MIMIC-III database. Results: CNNs outperform other phenotyping algorithms in all 10 tasks. The average F1-score of our model is 76 (PPV of 83,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Biomedical Text Mining and Ontologies · Topic Modeling
MethodsInterpretability
