Balanced Hyperbolic Embeddings Are Natural Out-of-Distribution Detectors
Tejaswi Kasarla, Max van Spengler, Pascal Mettes

TL;DR
This paper demonstrates that balanced hyperbolic embeddings serve as effective out-of-distribution detectors, outperforming existing methods across multiple datasets and scoring functions by leveraging hierarchical hyperbolic space.
Contribution
Introduces Balanced Hyperbolic Learning, a novel hyperbolic class embedding algorithm that optimizes hierarchical structure and balance for improved OOD detection.
Findings
Hyperbolic embeddings outperform existing OOD detection methods.
The approach beats other hyperbolic and contrastive methods.
Embeddings enable hierarchical OOD generalization.
Abstract
Out-of-distribution recognition forms an important and well-studied problem in deep learning, with the goal to filter out samples that do not belong to the distribution on which a network has been trained. The conclusion of this paper is simple: a good hierarchical hyperbolic embedding is preferred for discriminating in- and out-of-distribution samples. We introduce Balanced Hyperbolic Learning. We outline a hyperbolic class embedding algorithm that jointly optimizes for hierarchical distortion and balancing between shallow and wide subhierarchies. We then use the class embeddings as hyperbolic prototypes for classification on in-distribution data. We outline how to generalize existing out-of-distribution scoring functions to operate with hyperbolic prototypes. Empirical evaluations across 13 datasets and 13 scoring functions show that our hyperbolic embeddings outperform existing…
Peer Reviews
Decision·Submitted to ICLR 2025
## Strengths 1. The paper is generally well organized. 2. Connections between hyperbolic representation learning and out-of-distribution detection have been evidenced by prior work and new contributions at this intersection are interesting and valuable to the community. 3. The authors provide a good coverage of related works in out-of-distribution detection, hyperbolic embeddings of images, and hyperbolic learning of hierarchies, to place their proposed method in the right context. 4. Th
## Weaknesses 1. Some of the claims made by the authors are not substantiated by prior work or experimental evidence, for instance L077 “existing hyperbolic embedding methods are biased towards deeper and wider sub-trees, with smaller sub-trees pushed towards the origin” - how do the authors define the bias and how is this verified experimentally? 2. Key details of the setup and specific experiments are missing, making it difficult to accurately evaluate the comparison with prior methods and
The hierarchical hyperbolic embedding shows promise for OOD detection, which is an interesting idea. The effectiveness of the method is demonstrated through comparisons with various benchmarks and ablation studies. Overall, the paper is clear.
1) The motivation of “hierarchical” hyperbolic embedding for OOD detection is somewhat unclear. Could you please clarify the motivation behind using hierarchical relationships for hyperbolic embedding in OOD detection? Although it is well-known that hyperbolic embeddings can effectively represent distances in hierarchical graphs, it’s a little confusing about how this specifically benefits OOD detection. 2) It would be helpful to include some recent related works on hierarchical hyperbolic emb
**Structure and Clarity:** - The problem statement is very clear, hypothesis is well reasoned, and the justification well described. - Preliminaries are concise yet highly informative, and add little bloat to the work while providing the reader with the necessary understanding for both OOD and hyperbolic learning. - Results are presented clearly, with descriptive figures, graphs and accompanying written discussions. **Method hypothesis, findings, and rationale:** - The method is very well mot
**Learning a well defined hierarchy:** - One assumption you make is that the hyperbolic method does indeed learn a strong hierarchy. However, this work does not demonstrate empirically or theoretically that the hierarchy is in fact learnt. The distortion loss should help enforce this structure, however, empirical analysis would be a great addition. - Following from the prior, it is well known that in hyperbolic space embeddings can “collapse” to the boundary of the ball, and hence no hierarchy
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis
