A Survey of Safety on Large Vision-Language Models: Attacks, Defenses   and Evaluations

Mang Ye; Xuankun Rong; Wenke Huang; Bo Du; Nenghai Yu; Dacheng Tao

arXiv:2502.14881·cs.CR·February 24, 2025

A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and Evaluations

Mang Ye, Xuankun Rong, Wenke Huang, Bo Du, Nenghai Yu, Dacheng Tao

PDF

1 Repo

TL;DR

This survey comprehensively analyzes safety issues in large vision-language models, covering attacks, defenses, and evaluations, and introduces a unified framework and classification system to guide future research and improve model robustness.

Contribution

It provides a holistic framework for LVLM safety, introduces a lifecycle-based classification, and offers empirical safety evaluations and strategic recommendations for future improvements.

Findings

01

Identified key vulnerabilities in LVLMs

02

Evaluated safety of Deepseek Janus-Pro model

03

Outlined future directions for robustness enhancement

Abstract

With the rapid advancement of Large Vision-Language Models (LVLMs), ensuring their safety has emerged as a crucial area of research. This survey provides a comprehensive analysis of LVLM safety, covering key aspects such as attacks, defenses, and evaluation methods. We introduce a unified framework that integrates these interrelated components, offering a holistic perspective on the vulnerabilities of LVLMs and the corresponding mitigation strategies. Through an analysis of the LVLM lifecycle, we introduce a classification framework that distinguishes between inference and training phases, with further subcategories to provide deeper insights. Furthermore, we highlight limitations in existing research and outline future directions aimed at strengthening the robustness of LVLMs. As part of our research, we conduct a set of safety evaluations on the latest LVLM, Deepseek Janus-Pro, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

XuankunRong/Awesome-LVLM-Safety
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Evolutionary Training