Terminologies augmented recurrent neural network model for clinical named entity recognition
Ivan Lerner, Nicolas Paris, Xavier Tannier

TL;DR
This paper introduces a hybrid neural network model augmented with medical terminologies to improve clinical named-entity recognition in both English and French, achieving near state-of-the-art results.
Contribution
The study presents a novel hybrid system combining terminology-based features with biGRU-CRF for clinical NER, including a new French corpus for diverse clinical entities.
Findings
Hybrid system outperforms standalone models in F-measure.
Terminology augmentation improves performance in low-resource settings.
French corpus APcNER enables effective evaluation of clinical NER in French.
Abstract
We aimed to enhance the performance of a supervised model for clinical named-entity recognition (NER) using medical terminologies. In order to evaluate our system in French, we built a corpus for 5 types of clinical entities. We used a terminology-based system as baseline, built upon UMLS and SNOMED. Then, we evaluated a biGRU-CRF, and an hybrid system using the prediction of the terminology-based system as feature for the biGRU-CRF. In English, we evaluated the NER systems on the i2b2-2009 Medication Challenge for Drug name recognition, which contained 8,573 entities for 268 documents. In French, we built APcNER, a corpus of 147 documents annotated for 5 entities (drug name, sign or symptom, disease or disorder, diagnostic procedure or lab test and therapeutic procedure). We evaluated each NER systems using exact and partial match definition of F-measure for NER. The APcNER contains…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
