Beyond Graph Model: Reliable VLM Fine-Tuning via Random Graph Adapter
Bo Jiang, Xueyang Ze, Beibei Wang, Xixi Wang, Xixi Wan, and Bin Luo

TL;DR
This paper introduces VRGAdapter, a novel probabilistic graph-based adapter for Vision-Language Models that captures diverse textual descriptions and inter-class relationships, improving downstream task performance.
Contribution
The paper proposes a Vertex Random Graph Adapter (VRGAdapter) that models textual diversity and class relationships using a probabilistic graph, enhancing VLM fine-tuning.
Findings
VRGAdapter outperforms traditional adapters on benchmark datasets.
The probabilistic message propagation captures rich semantic information.
Uncertainty-guided fusion improves ensemble robustness.
Abstract
Textual adapter-based tuning methods have shown significant potential in transferring knowledge from pre-trained Vision-Language Models (VLMs) to downstream tasks. Existing works generally employ the deterministic textual feature adapter to refine each category textual representation. However, due to inherent factors such as different attributes and contexts, there exists significant diversity in textual descriptions for each category. Such description diversity offers rich discriminative semantic knowledge that can benefit downstream visual learning tasks. Obviously, traditional deterministic adapter model cannot adequately capture this varied semantic information. Also, it is desirable to exploit the inter-class relationships in VLM adapter. To address these issues, we propose to exploit random graph model into VLM adapter and develop a novel Vertex Random Graph Adapter (VRGAdapter).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Optical Network Technologies · Blind Source Separation Techniques
