PathReasoner-R1: Instilling Structured Reasoning into Pathology Vision-Language Model via Knowledge-Guided Policy Optimization

Songhan Jiang; Fengchun Liu; Ziyue Wang; Linghan Cai; Yongbing Zhang

arXiv:2601.21617·cs.CV·January 30, 2026

PathReasoner-R1: Instilling Structured Reasoning into Pathology Vision-Language Model via Knowledge-Guided Policy Optimization

Songhan Jiang, Fengchun Liu, Ziyue Wang, Linghan Cai, Yongbing Zhang

PDF

Open Access

TL;DR

PathReasoner-R1 introduces a knowledge-guided training approach for pathology vision-language models, enabling structured, transparent reasoning aligned with medical knowledge, thereby improving diagnostic robustness and clinical trust.

Contribution

The paper presents PathReasoner-R1, a novel model that integrates medical knowledge graphs into reasoning training, creating the first large-scale WSI reasoning dataset and a reinforcement learning framework for structured pathology reasoning.

Findings

01

Achieves state-of-the-art performance on pathology reasoning benchmarks.

02

Demonstrates improved logical consistency and robustness in diagnoses.

03

Provides a new dataset and training pipeline for pathology vision-language models.

Abstract

Vision-Language Models (VLMs) are advancing computational pathology with superior visual understanding capabilities. However, current systems often reduce diagnosis to directly output conclusions without verifiable evidence-linked reasoning, which severely limits clinical trust and hinders expert error rectification. To address these barriers, we construct PathReasoner, the first large-scale dataset of whole-slide image (WSI) reasoning. Unlike previous work reliant on unverified distillation, we develop a rigorous knowledge-guided generation pipeline. By leveraging medical knowledge graphs, we explicitly align structured pathological findings and clinical reasoning with diagnoses, generating over 20K high-quality instructional samples. Based on the database, we propose PathReasoner-R1, which synergizes trajectory-masked supervised fine-tuning with reasoning-oriented reinforcement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · AI in cancer detection · Digital Imaging for Blood Diseases