Phenotyping of Clinical Notes with Improved Document Classification Models Using Contextualized Neural Language Models
Andriy Mulyar, Elliot Schumacher, Masoud Rouhizadeh, Mark Dredze

TL;DR
This paper explores the use of BERT-based neural language models to improve the classification of clinical notes for patient phenotyping, eliminating manual feature engineering and achieving competitive or superior results.
Contribution
It introduces BERT-based architectures for phenotyping from clinical notes, demonstrating their effectiveness without manual rule-based engineering.
Findings
BERT models outperform traditional methods on phenotyping tasks.
Contextualized neural models achieve state-of-the-art performance.
Elimination of manual feature engineering simplifies the phenotyping process.
Abstract
Clinical notes contain an extensive record of a patient's health status, such as smoking status or the presence of heart conditions. However, this detail is not replicated within the structured data of electronic health systems. Phenotyping, the extraction of patient conditions from free clinical text, is a critical task which supports avariety of downstream applications such as decision support and secondary use of medical records. Previous work has resulted in systems which are high performing but require hand engineering, often of rules. Recent work in pretrained contextualized language models have enabled advances in representing text for a variety of tasks. We therefore explore several architectures for modeling pheno-typing that rely solely on BERT representations of the clinical note, removing the need for manual engineering. We find these architectures are competitive with or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
MethodsLinear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece · Softmax
