Capacity Bounds for Hyperbolic Neural Network Representations of Latent Tree Structures
Anastasis Kratsios, Ruiyang Hong, Haitz S\'aez de Oc\'ariz Borde

TL;DR
This paper proves that hyperbolic neural networks can embed finite trees into hyperbolic space with minimal distortion, and compares their capacity and complexity to Euclidean embeddings, revealing fundamental differences.
Contribution
It provides the first proof of hyperbolic neural networks' ability to embed trees with low distortion and analyzes their network complexity independently of representation fidelity.
Findings
HNNs can embed any finite weighted tree into hyperbolic space with minimal distortion.
Network complexity of HNNs is independent of embedding fidelity.
Euclidean embeddings of trees require significantly higher distortion, especially with more leaves.
Abstract
We study the representation capacity of deep hyperbolic neural networks (HNNs) with a ReLU activation function. We establish the first proof that HNNs can -isometrically embed any finite weighted tree into a hyperbolic space of dimension at least equal to with prescribed sectional curvature , for any (where being optimal). We establish rigorous upper bounds for the network complexity on an HNN implementing the embedding. We find that the network complexity of HNN implementing the graph representation is independent of the representation fidelity/distortion. We contrast this result against our lower bounds on distortion which any ReLU multi-layer perceptron (MLP) must exert when embedding a tree with leaves into a -dimensional Euclidean space, which we show at least ; independently of the depth,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Topological and Geometric Data Analysis · Stochastic Gradient Optimization Techniques
