MedGPT: Medical Concept Prediction from Clinical Narratives
Zeljko Kraljevic, Anthony Shek, Daniel Bean, Rebecca Bendayan, James, Teo, Richard Dobson

TL;DR
MedGPT is a transformer-based model that predicts future medical events from clinical narratives by structuring free text in EHRs, demonstrating improved accuracy and capturing medical knowledge through attention analysis.
Contribution
Introduces MedGPT, a novel transformer pipeline utilizing NER and linking tools to predict future disorders from unstructured EHR text data.
Findings
Achieves higher prediction precision than LSTM models.
Effectively handles noise in free text EHR data.
Captures medical knowledge via attention mechanisms.
Abstract
The data available in Electronic Health Records (EHRs) provides the opportunity to transform care, and the best way to provide better care for one patient is through learning from the data available on all other patients. Temporal modelling of a patient's medical history, which takes into account the sequence of past events, can be used to predict future events such as a diagnosis of a new disorder or complication of a previous or existing disorder. While most prediction approaches use mostly the structured data in EHRs or a subset of single-domain predictions and outcomes, we present MedGPT a novel transformer-based pipeline that uses Named Entity Recognition and Linking tools (i.e. MedCAT) to structure and organize the free text portion of EHRs and anticipate a range of future medical events (initially disorders). Since a large portion of EHR data is in text form, such an approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Topic Modeling · Biomedical Text Mining and Ontologies
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
