Relation Extraction Capabilities of LLMs on Clinical Text: A Bilingual Evaluation for English and Turkish
Aidana Aidynkyzy, O\u{g}uz Dikenelli, Oylum Alatl{\i}, \c{S}ebnem Bora

TL;DR
This study evaluates the relation extraction capabilities of large language models on clinical texts in English and Turkish, introducing a bilingual dataset and novel retrieval methods, showing prompting methods outperform fine-tuning.
Contribution
It presents the first bilingual clinical RE dataset and introduces Relation-Aware Retrieval, enhancing LLM performance in multilingual clinical NLP tasks.
Findings
Prompting-based LLMs outperform fine-tuned models.
English results are better than Turkish across all models.
RAR with structured reasoning achieves highest F1 scores.
Abstract
The scarcity of annotated datasets for clinical information extraction in non-English languages hinders the evaluation of large language model (LLM)-based methods developed primarily in English. In this study, we present the first comprehensive bilingual evaluation of LLMs for the clinical Relation Extraction (RE) task in both English and Turkish. To facilitate this evaluation, we introduce the first English-Turkish parallel clinical RE dataset, derived and carefully curated from the 2010 i2b2/VA relation classification corpus. We systematically assess a diverse set of prompting strategies, including multiple in-context learning (ICL) and Chain-of-Thought (CoT) approaches, and compare their performance to fine-tuned baselines such as PURE. Furthermore, we propose Relation-Aware Retrieval (RAR), a novel in-context example selection method based on contrastive learning, that is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Text Readability and Simplification
