Named Clinical Entity Recognition Benchmark
Wadood M Abdul,Marco AF Pimentel, Muhammad Umar Salman, Tathagata, Raha, Cl\'ement Christophe, Praveen K Kanithi, Nasir Hayat, Ronnie Rajan,, Shadab Khan

TL;DR
This paper presents a standardized benchmark for evaluating language models on clinical entity recognition, utilizing diverse datasets and the OMOP data model to promote transparency and innovation in healthcare NLP.
Contribution
It introduces a comprehensive benchmarking framework and leaderboard for clinical entity recognition, standardizing datasets and evaluation metrics across medical domains.
Findings
Models show varying performance across different clinical entities.
Standardization via OMOP improves interoperability and comparison.
Benchmarking reveals current limitations and areas for improvement.
Abstract
This technical report introduces a Named Clinical Entity Recognition Benchmark for evaluating language models in healthcare, addressing the crucial natural language processing (NLP) task of extracting structured information from clinical narratives to support applications like automated coding, clinical trial cohort identification, and clinical decision support. The leaderboard provides a standardized platform for assessing diverse language models, including encoder and decoder architectures, on their ability to identify and classify clinical entities across multiple medical domains. A curated collection of openly available clinical datasets is utilized, encompassing entities such as diseases, symptoms, medications, procedures, and laboratory measurements. Importantly, these entities are standardized according to the Observational Medical Outcomes Partnership (OMOP) Common Data Model,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Healthcare · Biomedical Text Mining and Ontologies
