TL;DR
This survey comprehensively analyzes safety issues in large vision-language models, covering attacks, defenses, and evaluations, and introduces a unified framework and classification system to guide future research and improve model robustness.
Contribution
It provides a holistic framework for LVLM safety, introduces a lifecycle-based classification, and offers empirical safety evaluations and strategic recommendations for future improvements.
Findings
Identified key vulnerabilities in LVLMs
Evaluated safety of Deepseek Janus-Pro model
Outlined future directions for robustness enhancement
Abstract
With the rapid advancement of Large Vision-Language Models (LVLMs), ensuring their safety has emerged as a crucial area of research. This survey provides a comprehensive analysis of LVLM safety, covering key aspects such as attacks, defenses, and evaluation methods. We introduce a unified framework that integrates these interrelated components, offering a holistic perspective on the vulnerabilities of LVLMs and the corresponding mitigation strategies. Through an analysis of the LVLM lifecycle, we introduce a classification framework that distinguishes between inference and training phases, with further subcategories to provide deeper insights. Furthermore, we highlight limitations in existing research and outline future directions aimed at strengthening the robustness of LVLMs. As part of our research, we conduct a set of safety evaluations on the latest LVLM, Deepseek Janus-Pro, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training
