Multilingual Clinical NER for Diseases and Medications Recognition in Cardiology Texts using BERT Embeddings
Manuela Daniela Danu, George Marica, Constantin Suciu, Lucian Mihai Itu, Oladimeji Farri

TL;DR
This paper develops multilingual BERT-based models to improve clinical named entity recognition for diseases and medications in cardiology texts across English, Spanish, and Italian, outperforming existing benchmarks.
Contribution
It introduces deep contextual embedding models tailored for multilingual clinical NER, addressing low-resource language challenges in cardiology texts.
Findings
Achieved high F1-scores across multiple languages and categories.
Outperformed mean and median leaderboard scores.
Demonstrated effectiveness of multilingual BERT models in clinical NER.
Abstract
The rapidly increasing volume of electronic health record (EHR) data underscores a pressing need to unlock biomedical knowledge from unstructured clinical texts to support advancements in data-driven clinical systems, including patient diagnosis, disease progression monitoring, treatment effects assessment, prediction of future clinical events, etc. While contextualized language models have demonstrated impressive performance improvements for named entity recognition (NER) systems in English corpora, there remains a scarcity of research focused on clinical texts in low-resource languages. To bridge this gap, our study aims to develop multiple deep contextual embedding models to enhance clinical NER in the cardiology domain, as part of the BioASQ MultiCardioNER shared task. We explore the effectiveness of different monolingual and multilingual BERT-based models, trained on general domain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Topic Modeling · Biomedical Text Mining and Ontologies
