Predicting Clinical Diagnosis from Patients Electronic Health Records Using BERT-based Neural Networks
Pavel Blinov, Manvel Avetisian, Vladimir Kokh, Dmitry Umerenkov,, Alexander Tuzhilin

TL;DR
This study introduces a novel BERT-based neural network model tailored for predicting clinical diagnoses from Russian Electronic Health Records, demonstrating superior performance over existing models and comparable accuracy to medical experts.
Contribution
We developed a modified BERT model with a new Fully-Connected layer approach, pretrained on domain-specific data, for improved clinical diagnosis prediction from EHRs.
Findings
Our model outperforms baseline text models in diagnosis classification.
The model achieves accuracy comparable to experienced medical professionals.
This approach is the largest study of its kind for Russian EHR data.
Abstract
In this paper we study the problem of predicting clinical diagnoses from textual Electronic Health Records (EHR) data. We show the importance of this problem in medical community and present comprehensive historical review of the problem and proposed methods. As the main scientific contributions we present a modification of Bidirectional Encoder Representations from Transformers (BERT) model for sequence classification that implements a novel way of Fully-Connected (FC) layer composition and a BERT model pretrained only on domain data. To empirically validate our model, we use a large-scale Russian EHR dataset consisting of about 4 million unique patient visits. This is the largest such study for the Russian language and one of the largest globally. We performed a number of comparative experiments with other text representation models on the task of multiclass classification for 265…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Attention Dropout · Adam · Dense Connections · Linear Warmup With Linear Decay · Residual Connection · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Layer Normalization
