PBSBench: A Multi-Level Vision-Language Framework and Benchmark for Hematopathology Whole Slide Image Interpretation
Yuanlong Wang, Weichi Chen, Adrian Rajab, Wenfang Liu, Yulan Jin, Andrew Srisuwananukorn, Ping Zhang

TL;DR
This paper introduces PBSBench, a comprehensive vision-language framework and benchmark tailored for interpreting hematopathology whole slide images, focusing on peripheral blood smears, with new datasets, models, and evaluation tasks.
Contribution
The authors create PBSInstr, a novel PBS-specific dataset, and develop PBS-VL, a specialized vision-language model, along with PBSBench, a benchmark for multi-level PBS interpretation.
Findings
PBS-VL outperforms existing models on PBSBench tasks.
The dataset includes 353 PBS WSIs, 29k cell crops, and 27k QA pairs.
The framework supports decision-making in hematopathology.
Abstract
Peripheral Blood Smear (PBS) is a critical microscopic examination in hematopathology that yields whole-slide imaging (WSI). Unlike solid tissue pathology, PBS interpretation focuses on individual cell morphologies rather than tissue architecture, making it distinct in both visual characteristics and diagnostic reasoning. However, current multimodal large language models (MLLMs) for pathology are primarily developed on solid-tissue WSIs and struggle to generalize to PBS. To bridge this gap, we construct PBSInstr, the first vision-language dataset for PBS interpretation, comprising 353 PBS WSIs paired with microscopic impression paragraphs and 29k cell-level image crops annotated with cell type labels and morphological descriptions. To facilitate instruction tuning, PBSInstr further includes 27k question-answer (QA) pairs for cell crops and 1,286 QA pairs for PBS slides. Building upon…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
