RJUA-QA: A Comprehensive QA Dataset for Urology
Shiwei Lyu, Chenfei Chi, Hongbo Cai, Lei Shi, Xiaoyan Yang, and Lei Liu, Xiang Chen, Deng Zhao, Zhiqiang Zhang, Xianguo Lyu, and Ming Zhang, Fangzhou Li, Xiaowei Ma, Yue Shen, Jinjie Gu and, Wei Xue, Yiran Huang

TL;DR
RJUA-QA is a new, comprehensive medical question-answering dataset focused on urology, designed to improve clinical reasoning and diagnostic capabilities of large language models with real patient scenarios.
Contribution
The paper introduces RJUA-QA, the first medical QA dataset for clinical reasoning in urology, covering extensive disease categories with expert-level knowledge and real-world clinical data.
Findings
Medical-specific LLMs show improved diagnostic accuracy on RJUA-QA.
General LLMs require fine-tuning to handle clinical reasoning tasks.
RJUA-QA dataset enhances the development of reliable medical AI applications.
Abstract
We introduce RJUA-QA, a novel medical dataset for question answering (QA) and reasoning with clinical evidence, contributing to bridge the gap between general large language models (LLMs) and medical-specific LLM applications. RJUA-QA is derived from realistic clinical scenarios and aims to facilitate LLMs in generating reliable diagnostic and advice. The dataset contains 2,132 curated Question-Context-Answer pairs, corresponding about 25,000 diagnostic records and clinical cases. The dataset covers 67 common urological disease categories, where the disease coverage exceeds 97.6\% of the population seeking medical services in urology. Each data instance in RJUA-QA comprises: (1) a question mirroring real patient to inquiry about clinical symptoms and medical conditions, (2) a context including comprehensive expert knowledge, serving as a reference for medical examination and diagnosis,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Machine Learning in Healthcare
