RiTeK: A Dataset for Large Language Models Complex Reasoning over Textual Knowledge Graphs in Medicine

Jiatan Huang; Mingchen Li; Zonghai Yao; Dawei Li; Yuxin Zhang; Zhichao Yang; Yongkang Xiao; Feiyun Ouyang; Xiaohan Li; Shuo Han; Hong Yu

arXiv:2410.13987·cs.CL·April 14, 2026

RiTeK: A Dataset for Large Language Models Complex Reasoning over Textual Knowledge Graphs in Medicine

Jiatan Huang, Mingchen Li, Zonghai Yao, Dawei Li, Yuxin Zhang, Zhichao Yang, Yongkang Xiao, Feiyun Ouyang, Xiaohan Li, Shuo Han, Hong Yu

PDF

TL;DR

RiTeK is a new dataset designed to evaluate and improve large language models' ability to perform complex reasoning over medical textual knowledge graphs, addressing current limitations in medical data and retrieval methods.

Contribution

The paper introduces RiTeK, a comprehensive medical TKG dataset with synthesized queries and a benchmark for evaluating retrieval systems in medical reasoning tasks.

Findings

01

Existing retrieval methods perform poorly on RiTeK benchmark.

02

RiTeK covers diverse topological structures and complex textual descriptions.

03

Expert evaluation confirms the quality of synthesized queries.

Abstract

Answering complex real-world questions in the medical domain often requires accurate retrieval from medical Textual Knowledge Graphs (medical TKGs), as the relational path information from TKGs could enhance the inference ability of Large Language Models (LLMs). However, the main bottlenecks lie in the scarcity of existing medical TKGs, the limited expressiveness of their topological structures, and the lack of comprehensive evaluations of current retrievers for medical TKGs. To address these challenges, we first develop a Dataset1 for LLMs Complex Reasoning over medical Textual Knowledge Graphs (RiTeK), covering a broad range of topological structures. Specifically, we synthesize realistic user queries integrating diverse topological structures, relational information, and complex textual descriptions. We conduct a rigorous medical expert evaluation process to assess and validate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.