ControlMed: Adding Reasoning Control to Medical Language Model
Sung-Min Lee, Siyoon Lee, Juyeon Kim, Kyoungmin Roh

TL;DR
ControlMed is a medical language model that allows users to control the length of reasoning processes during inference, balancing accuracy and efficiency, and trained through a multi-stage process including reinforcement learning.
Contribution
It introduces a novel method for actively controlling reasoning length in medical LLMs, improving practicality and adaptability in clinical settings.
Findings
Achieves comparable or better performance than state-of-the-art models.
Enables flexible balancing of reasoning accuracy and computational efficiency.
Demonstrates effectiveness on English and Korean medical benchmarks.
Abstract
Reasoning Large Language Models (LLMs) with enhanced accuracy and explainability are increasingly being adopted in the medical domain, as the life-critical nature of clinical decision-making demands reliable support. Despite these advancements, existing reasoning LLMs often generate unnecessarily lengthy reasoning processes, leading to significant computational overhead and response latency. These limitations hinder their practical deployment in real-world clinical environments. To address these challenges, we introduce \textbf{ControlMed}, a medical language model that enables users to actively control the length of the reasoning process at inference time through fine-grained control markers. ControlMed is trained through a three-stage pipeline: 1) pre-training on a large-scale synthetic medical instruction dataset covering both \textit{direct} and \textit{reasoning responses}; 2)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Healthcare · Multimodal Machine Learning Applications
