VSF-Med:A Vulnerability Scoring Framework for Medical Vision-Language Models
Binesh Sadanandan, Vahid Behzadan

TL;DR
This paper presents VSF-Med, a comprehensive vulnerability scoring framework for medical vision-language models, combining attack templates, imperceptible perturbations, and a multi-dimensional rubric to evaluate security risks systematically.
Contribution
The paper introduces VSF-Med, the first end-to-end framework for systematically assessing vulnerabilities in medical VLMs using novel attack methods and a standardized scoring rubric.
Findings
State-of-the-art VLMs show significant vulnerability increases under attack.
VSF-Med enables reproducible benchmarking with over 30,000 adversarial variants.
Llama-3.2-11B-Vision-Instruct exhibits the highest vulnerability among tested models.
Abstract
Vision Language Models (VLMs) hold great promise for streamlining labour-intensive medical imaging workflows, yet systematic security evaluations in clinical settings remain scarce. We introduce VSF--Med, an end-to-end vulnerability-scoring framework for medical VLMs that unites three novel components: (i) a rich library of sophisticated text-prompt attack templates targeting emerging threat vectors; (ii) imperceptible visual perturbations calibrated by structural similarity (SSIM) thresholds to preserve clinical realism; and (iii) an eight-dimensional rubric evaluated by two independent judge LLMs, whose raw scores are consolidated via z-score normalization to yield a 0--32 composite risk metric. Built entirely on publicly available datasets and accompanied by open-source code, VSF--Med synthesizes over 30,000 adversarial variants from 5,000 radiology images and enables reproducible…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Artificial Intelligence in Healthcare and Education · Domain Adaptation and Few-Shot Learning
