Clipped Hyperbolic Classifiers Are Super-Hyperbolic Classifiers
Yunhui Guo, Xudong Wang, Yubei Chen, Stella X. Yu

TL;DR
This paper introduces clipped hyperbolic neural networks that overcome previous limitations by addressing vanishing gradients, resulting in classifiers that outperform traditional hyperbolic models and match Euclidean neural networks across various datasets.
Contribution
The paper proposes a simple clipping technique during training to enhance hyperbolic neural networks, making them competitive with Euclidean models on standard benchmarks.
Findings
Clipped HNNs outperform unclipped HNNs on hierarchical datasets.
Clipped HNNs match Euclidean neural networks on MNIST, CIFAR, and ImageNet.
Clipped HNNs exhibit improved adversarial robustness and out-of-distribution detection.
Abstract
Hyperbolic space can naturally embed hierarchies, unlike Euclidean space. Hyperbolic Neural Networks (HNNs) exploit such representational power by lifting Euclidean features into hyperbolic space for classification, outperforming Euclidean neural networks (ENNs) on datasets with known semantic hierarchies. However, HNNs underperform ENNs on standard benchmarks without clear hierarchies, greatly restricting HNNs' applicability in practice. Our key insight is that HNNs' poorer general classification performance results from vanishing gradients during backpropagation, caused by their hybrid architecture connecting Euclidean features to a hyperbolic classifier. We propose an effective solution by simply clipping the Euclidean feature magnitude while training HNNs. Our experiments demonstrate that clipped HNNs become super-hyperbolic classifiers: They are not only consistently better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Cell Image Analysis Techniques · Advanced Neural Network Applications
