Training-Free Dual Hyperbolic Adapters for Better Cross-Modal Reasoning
Yi Zhang, Chun-Wun Cheng, Junyi He, Ke Yu, Yushun Tang, Carola-Bibiane Sch\"onlieb, Zhihai He, Angelica I. Aviles-Rivero

TL;DR
This paper introduces Training-free Dual Hyperbolic Adapters (T-DHA), a novel method that embeds hierarchical semantic relationships in hyperbolic space to enhance cross-modal reasoning in vision-language models without additional training.
Contribution
The paper proposes a training-free hyperbolic adapter that models hierarchical semantic data in hyperbolic space, improving efficiency and performance in cross-modal reasoning tasks.
Findings
T-DHA outperforms state-of-the-art methods in few-shot image recognition.
T-DHA demonstrates superior domain generalization capabilities.
Hyperbolic embedding enhances hierarchical data representation.
Abstract
Recent research in Vision-Language Models (VLMs) has significantly advanced our capabilities in cross-modal reasoning. However, existing methods suffer from performance degradation with domain changes or require substantial computational resources for fine-tuning in new domains. To address this issue, we develop a new adaptation method for large vision-language models, called \textit{Training-free Dual Hyperbolic Adapters} (T-DHA). We characterize the vision-language relationship between semantic concepts, which typically has a hierarchical tree structure, in the hyperbolic space instead of the traditional Euclidean space. Hyperbolic spaces exhibit exponential volume growth with radius, unlike the polynomial growth in Euclidean space. We find that this unique property is particularly effective for embedding hierarchical data structures using the Poincar\'e ball model, achieving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
