TL;DR
This paper explores the benefits of hyperbolic metric learning in computer vision, analyzing its theoretical foundations, comparing it with Euclidean space, and highlighting its connection to hard negative sampling for improved image embeddings.
Contribution
It provides a comprehensive benchmark of hyperbolic versus Euclidean spaces in vision transformers and offers a theoretical explanation for performance gains.
Findings
Hyperbolic space improves metric learning performance.
Hyperbolic metric learning is closely related to hard negative sampling.
Hybrid Euclidean-hyperbolic loss enhances vision transformer results.
Abstract
In recent years, there has been a growing trend of incorporating hyperbolic geometry methods into computer vision. While these methods have achieved state-of-the-art performance on various metric learning tasks using hyperbolic distance measurements, the underlying theoretical analysis supporting this superior performance remains under-exploited. In this study, we investigate the effects of integrating hyperbolic space into metric learning, particularly when training with contrastive loss. We identify a need for a comprehensive comparison between Euclidean and hyperbolic spaces regarding the temperature effect in the contrastive loss within the existing literature. To address this gap, we conduct an extensive investigation to benchmark the results of Vision Transformers (ViTs) using a hybrid objective function that combines loss from Euclidean and hyperbolic spaces. Additionally, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Understanding Hyperbolic Metric Learning Through Hard Negative Sampling· youtube
