Multi-View Synergistic Learning with Vision-Language Adaption for Low-Resource Biomedical Image Classification

Xiaoliu Luo; Minxue Xiao; Ting Xie; Mengzhu Wang; Huiqing Qi; Joey Tianyi Zhou; Taiping Zhang; Xu Wang

arXiv:2604.23977·cs.CV·April 28, 2026

Multi-View Synergistic Learning with Vision-Language Adaption for Low-Resource Biomedical Image Classification

Xiaoliu Luo, Minxue Xiao, Ting Xie, Mengzhu Wang, Huiqing Qi, Joey Tianyi Zhou, Taiping Zhang, Xu Wang

PDF

TL;DR

This paper introduces MVSL, a framework that enhances low-resource biomedical image classification by decoupling visual and textual adaptation, employing multi-granularity contrastive learning, and leveraging language models for semantic regularization.

Contribution

MVSL is a novel unified approach that improves parameter-efficient adaptation and fine-grained discrimination in biomedical image classification under limited supervision.

Findings

01

MVSL outperforms state-of-the-art methods on 11 biomedical datasets.

02

It achieves superior few-shot and zero-shot classification accuracy.

03

The framework effectively models both global and local image semantics.

Abstract

Accurate biomedical image classification under low-resource conditions remains challenging due to limited annotations, subtle inter-class visual differences, and complex disease semantics. While vision--language models offer a promising foundation for mitigating data scarcity, their effective adaptation in biomedical settings is constrained by the need for parameter-efficient tuning alongside fine-grained and semantically consistent representation learning. In this work, we propose Multi-View Synergistic Learning (MVSL), a unified framework that addresses these challenges by jointly considering adaptation paradigms, representation granularity, and disease semantic relationships. MVSL decouples the adaptation of visual and textual encoders to respect their distinct representational characteristics, enabling more stable and effective parameter-efficient fine-tuning. It further introduces…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.