ProMed: Shapley Information Gain Guided Reinforcement Learning for Proactive Medical LLMs

Hongxin Ding; Baixiang Huang; Yue Fang; Weibin Liao; Xinke Jiang; Zheng Li; Junfeng Zhao; Yasha Wang

arXiv:2508.13514·cs.CL·August 20, 2025

ProMed: Shapley Information Gain Guided Reinforcement Learning for Proactive Medical LLMs

Hongxin Ding, Baixiang Huang, Yue Fang, Weibin Liao, Xinke Jiang, Zheng Li, Junfeng Zhao, Yasha Wang

PDF

TL;DR

ProMed introduces a reinforcement learning framework with Shapley Information Gain rewards to enable medical LLMs to proactively ask questions, improving diagnostic accuracy and interaction quality in clinical settings.

Contribution

The paper presents ProMed, a novel RL-based approach that guides medical LLMs to ask clinically valuable questions using SIG, enhancing proactive medical questioning capabilities.

Findings

01

ProMed outperforms state-of-the-art methods by 6.29% on medical benchmarks.

02

ProMed achieves a 54.45% improvement over reactive models.

03

ProMed generalizes well to out-of-domain cases.

Abstract

Interactive medical questioning is essential in real-world clinical consultations, where physicians must actively gather information from patients. While medical Large Language Models (LLMs) have shown impressive capabilities in static medical question answering, they predominantly operate under a reactive paradigm: generating answers directly without seeking additional information, which risks incorrect diagnoses in such interactive settings. To address this limitation, we propose ProMed, a reinforcement learning (RL) framework that transitions medical LLMs toward a proactive paradigm, equipping them with the ability to ask clinically valuable questions before decision-making. At the core of ProMed is the Shapley Information Gain (SIG) reward, which quantifies the clinical utility of each question by combining the amount of newly acquired information with its contextual importance,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.