Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner

Wenchuan Zhang; Penghao Zhang; Jingru Guo; Tao Cheng; Jie Chen; Shuwan Zhang; Zhang Zhang; Yuhao Yi; Hong Bu

arXiv:2505.11404·cs.CV·March 24, 2026

Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner

Wenchuan Zhang, Penghao Zhang, Jingru Guo, Tao Cheng, Jie Chen, Shuwan Zhang, Zhang Zhang, Yuhao Yi, Hong Bu

PDF

Open Access 1 Repo 4 Models 1 Video

TL;DR

Patho-R1 is a multimodal reinforcement learning-based pathology reasoning system that leverages high-quality datasets and a three-stage training pipeline to improve diagnostic accuracy and reasoning in pathology tasks.

Contribution

The paper introduces Patho-R1, a novel multimodal RL-based pathology reasoner trained on high-quality datasets with a three-stage pipeline, enhancing reasoning and diagnostic performance.

Findings

01

Patho-R1 achieves robust performance on pathology tasks.

02

Patho-CLIP aligns well with pathology datasets.

03

Reinforcement learning improves reasoning quality.

Abstract

Recent advances in vision language models (VLMs) have enabled broad progress in the general medical field. However, pathology still remains a more challenging subdomain, with current pathology specific VLMs exhibiting limitations in both diagnostic accuracy and reasoning plausibility. Such shortcomings are largely attributable to the nature of current pathology datasets, which are primarily composed of image description pairs that lack the depth and structured diagnostic paradigms employed by real world pathologists. In this study, we leverage pathology textbooks and real world pathology experts to construct high-quality, reasoning-oriented datasets. Building on this, we introduce Patho-R1, a multimodal RL-based pathology Reasoner, trained through a three-stage pipeline: (1) continued pretraining on 3.5 million image-text pairs for knowledge infusion; (2) supervised fine-tuning on 500k…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wenchuan-zhang/patho-r1
paddleOfficial

Models

Videos

Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner· underline

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning

MethodsContrastive Language-Image Pre-training