Relation Extraction Capabilities of LLMs on Clinical Text: A Bilingual Evaluation for English and Turkish

Aidana Aidynkyzy; O\u{g}uz Dikenelli; Oylum Alatl{\i}; \c{S}ebnem Bora

arXiv:2601.09367·cs.CL·January 15, 2026

Relation Extraction Capabilities of LLMs on Clinical Text: A Bilingual Evaluation for English and Turkish

Aidana Aidynkyzy, O\u{g}uz Dikenelli, Oylum Alatl{\i}, \c{S}ebnem Bora

PDF

Open Access

TL;DR

This study evaluates the relation extraction capabilities of large language models on clinical texts in English and Turkish, introducing a bilingual dataset and novel retrieval methods, showing prompting methods outperform fine-tuning.

Contribution

It presents the first bilingual clinical RE dataset and introduces Relation-Aware Retrieval, enhancing LLM performance in multilingual clinical NLP tasks.

Findings

01

Prompting-based LLMs outperform fine-tuned models.

02

English results are better than Turkish across all models.

03

RAR with structured reasoning achieves highest F1 scores.

Abstract

The scarcity of annotated datasets for clinical information extraction in non-English languages hinders the evaluation of large language model (LLM)-based methods developed primarily in English. In this study, we present the first comprehensive bilingual evaluation of LLMs for the clinical Relation Extraction (RE) task in both English and Turkish. To facilitate this evaluation, we introduce the first English-Turkish parallel clinical RE dataset, derived and carefully curated from the 2010 i2b2/VA relation classification corpus. We systematically assess a diverse set of prompting strategies, including multiple in-context learning (ICL) and Chain-of-Thought (CoT) approaches, and compare their performance to fine-tuned baselines such as PURE. Furthermore, we propose Relation-Aware Retrieval (RAR), a novel in-context example selection method based on contrastive learning, that is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Text Readability and Simplification