Enhancing Zero-Shot Brain Tumor Subtype Classification via Fine-Grained Patch-Text Alignment

Lubin Gan; Jing Zhang; Linhao Qu; Yijun Wang; Siying Wu; Xiaoyan Sun

arXiv:2508.01602·cs.CV·August 7, 2025

Enhancing Zero-Shot Brain Tumor Subtype Classification via Fine-Grained Patch-Text Alignment

Lubin Gan, Jing Zhang, Linhao Qu, Yijun Wang, Siying Wu, Xiaoyan Sun

PDF

TL;DR

This paper introduces FG-PAN, a zero-shot framework that improves brain tumor subtype classification from histopathological images by aligning refined visual features with pathology-aware text descriptions, achieving state-of-the-art results.

Contribution

The paper presents a novel fine-grained patch alignment network that enhances zero-shot classification by combining spatially refined visual features with large language model-generated semantic prototypes.

Findings

01

Achieves state-of-the-art zero-shot classification accuracy on pathology datasets.

02

Effectively captures subtle morphological differences in tumor subtypes.

03

Demonstrates robust generalization across multiple datasets.

Abstract

The fine-grained classification of brain tumor subtypes from histopathological whole slide images is highly challenging due to subtle morphological variations and the scarcity of annotated data. Although vision-language models have enabled promising zero-shot classification, their ability to capture fine-grained pathological features remains limited, resulting in suboptimal subtype discrimination. To address these challenges, we propose the Fine-Grained Patch Alignment Network (FG-PAN), a novel zero-shot framework tailored for digital pathology. FG-PAN consists of two key modules: (1) a local feature refinement module that enhances patch-level visual features by modeling spatial relationships among representative patches, and (2) a fine-grained text description generation module that leverages large language models to produce pathology-aware, class-specific semantic prototypes. By…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.