PathReasoner-R1: Instilling Structured Reasoning into Pathology Vision-Language Model via Knowledge-Guided Policy Optimization
Songhan Jiang, Fengchun Liu, Ziyue Wang, Linghan Cai, Yongbing Zhang

TL;DR
PathReasoner-R1 introduces a knowledge-guided training approach for pathology vision-language models, enabling structured, transparent reasoning aligned with medical knowledge, thereby improving diagnostic robustness and clinical trust.
Contribution
The paper presents PathReasoner-R1, a novel model that integrates medical knowledge graphs into reasoning training, creating the first large-scale WSI reasoning dataset and a reinforcement learning framework for structured pathology reasoning.
Findings
Achieves state-of-the-art performance on pathology reasoning benchmarks.
Demonstrates improved logical consistency and robustness in diagnoses.
Provides a new dataset and training pipeline for pathology vision-language models.
Abstract
Vision-Language Models (VLMs) are advancing computational pathology with superior visual understanding capabilities. However, current systems often reduce diagnosis to directly output conclusions without verifiable evidence-linked reasoning, which severely limits clinical trust and hinders expert error rectification. To address these barriers, we construct PathReasoner, the first large-scale dataset of whole-slide image (WSI) reasoning. Unlike previous work reliant on unverified distillation, we develop a rigorous knowledge-guided generation pipeline. By leveraging medical knowledge graphs, we explicitly align structured pathological findings and clinical reasoning with diagnoses, generating over 20K high-quality instructional samples. Based on the database, we propose PathReasoner-R1, which synergizes trajectory-masked supervised fine-tuning with reasoning-oriented reinforcement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · AI in cancer detection · Digital Imaging for Blood Diseases
