Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative Search
Haoran Sun, Yankai Jiang, Wenjie Lou, Yujie Zhang, Wenjie Li, Lilong Wang, Mianxin Liu, Lei Liu, Xiaosong Wang

TL;DR
This paper introduces MICS, a collaborative search scheme for generating high-quality medical reasoning data, leading to a new multimodal medical language model, Chiron-o1, with improved reasoning and visual question-answering abilities.
Contribution
It proposes MICS for effective medical reasoning data generation and develops Chiron-o1, a multimodal medical language model trained on this data, achieving state-of-the-art results.
Findings
Chiron-o1 outperforms existing models on medical VQA benchmarks.
MICS effectively generates high-quality reasoning paths for medical diagnosis.
Chiron-o1 demonstrates robust generalizable medical reasoning capabilities.
Abstract
Multimodal large language models (MLLMs) have begun to demonstrate robust reasoning capabilities on general tasks, yet their application in the medical domain remains in its early stages. Constructing chain-of-thought (CoT) training data is essential for bolstering the reasoning abilities of medical MLLMs. However, existing approaches exhibit a deficiency in offering a comprehensive framework for searching and evaluating effective reasoning paths towards critical diagnosis. To address this challenge, we propose Mentor-Intern Collaborative Search (MICS), a novel reasoning-path searching scheme to generate rigorous and effective medical CoT data. MICS first leverages mentor models to initialize the reasoning, one step at a time, then prompts each intern model to continue the thinking along those initiated paths, and finally selects the optimal reasoning path according to the overall…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Advanced Graph Neural Networks
