SIF: Semantically In-Distribution Fingerprints for Large Vision-Language Models

Yifei Zhao; Qian Lou; Mengxin Zheng

arXiv:2604.17041·cs.CV·April 21, 2026

SIF: Semantically In-Distribution Fingerprints for Large Vision-Language Models

Yifei Zhao, Qian Lou, Mengxin Zheng

PDF

1 Repo 1 Datasets

TL;DR

SIF introduces a novel, non-intrusive fingerprinting method for large vision-language models that ensures ownership verification is both stealthy and robust against adversarial attacks and model modifications.

Contribution

The paper proposes SIF, a semantic-preserving fingerprinting framework that transfers text watermarks into visual responses and enhances robustness through worst-case perturbation simulation.

Findings

01

SIF achieves high stealthiness and robustness in experiments.

02

It effectively resists semantic divergence attacks and model modifications.

03

Code is publicly available at the provided GitHub URL.

Abstract

The public accessibility of large vision-language models (LVLMs) raises serious concerns about unauthorized model reuse and intellectual property infringement. Existing ownership verification methods often rely on semantically abnormal queries or out-of-distribution responses as fingerprints, which can be easily detected and removed by adversaries. We expose this vulnerability through a Semantic Divergence Attack (SDA), which identifies and filters fingerprint queries by measuring semantic divergence between a suspect model and a reference model, showing that existing fingerprints are not semantic-preserving and are therefore easy to detect and bypass. To address these limitations, we propose SIF (Semantically In-Distribution Fingerprints), a non-intrusive ownership verification framework that requires no parameter modification. SIF introduces Semantic-Aligned Fingerprint Distillation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

UCF-ML-Research/SIF-VLM-Fingerprint
github

Datasets

yifeiz29/VLM-Normal-Requests
dataset· 73 dl
73 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.