Investigating Alternative Feature Extraction Pipelines For Clinical Note Phenotyping
Neil Daniel

TL;DR
This paper compares a new feature extraction pipeline using ScispaCy and supervised models to traditional BERT-based methods for clinical note phenotyping, highlighting tradeoffs in accuracy, runtime, and attribute prediction.
Contribution
It introduces an alternative feature extraction pipeline for clinical note phenotyping that balances accuracy, runtime, and the ability to predict unseen attributes.
Findings
Alternative pipeline moderately underperforms LSTM in accuracy.
It offers faster runtime and the ability to predict new medical attributes.
The method can supplement existing phenotyping approaches.
Abstract
A common practice in the medical industry is the use of clinical notes, which consist of detailed patient observations. However, electronic health record systems frequently do not contain these observations in a structured format, rendering patient information challenging to assess and evaluate automatically. Using computational systems for the extraction of medical attributes offers many applications, including longitudinal analysis of patients, risk assessment, and hospital evaluation. Recent work has constructed successful methods for phenotyping: extracting medical attributes from clinical notes. BERT-based models can be used to transform clinical notes into a series of representations, which are then condensed into a single document representation based on their CLS embeddings and passed into an LSTM (Mulyar et al., 2020). Though this pipeline yields a considerable performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Semantic Web and Ontologies
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
