TL;DR
MedThink is a two-stage distillation framework that improves small language models' clinical reasoning and diagnostic accuracy by iterative teacher-guided training, outperforming other strategies on medical benchmarks.
Contribution
The paper introduces MedThink, a novel two-stage distillation method that enhances reasoning and accuracy in small models for clinical diagnosis tasks.
Findings
MedThink achieves up to 12.7% improvement over baseline in general tasks.
Reaches 56.4% top accuracy in gastroenterology dataset.
Outperforms six other distillation strategies across benchmarks.
Abstract
Accurate clinical diagnosis requires extensive domain knowledge and complex clinical reasoning capabilities. Although large language models (LLMs) hold great potential for clinical reasoning, their high computational and memory requirements limit their deployment in resource-constrained environments. Knowledge distillation (KD) can compress LLM capabilities into smaller models, but traditional KD merely transfers superficial answer patterns and fails to preserve the structured reasoning required for reliable diagnosis. To address this, we propose a two-stage distillation framework, MedThink, designed to cultivate robust clinical reasoning in small language models (SLMs). In the first stage, a teacher LLM screens data and injects domain-knowledge explanations to fine-tune a student model, establishing a knowledge foundation. In the second stage, the teacher evaluates the student's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
