Capacity Bounds for Hyperbolic Neural Network Representations of Latent   Tree Structures

Anastasis Kratsios; Ruiyang Hong; Haitz S\'aez de Oc\'ariz Borde

arXiv:2308.09250·cs.LG·August 21, 2023

Capacity Bounds for Hyperbolic Neural Network Representations of Latent Tree Structures

Anastasis Kratsios, Ruiyang Hong, Haitz S\'aez de Oc\'ariz Borde

PDF

Open Access

TL;DR

This paper proves that hyperbolic neural networks can embed finite trees into hyperbolic space with minimal distortion, and compares their capacity and complexity to Euclidean embeddings, revealing fundamental differences.

Contribution

It provides the first proof of hyperbolic neural networks' ability to embed trees with low distortion and analyzes their network complexity independently of representation fidelity.

Findings

01

HNNs can embed any finite weighted tree into hyperbolic space with minimal distortion.

02

Network complexity of HNNs is independent of embedding fidelity.

03

Euclidean embeddings of trees require significantly higher distortion, especially with more leaves.

Abstract

We study the representation capacity of deep hyperbolic neural networks (HNNs) with a ReLU activation function. We establish the first proof that HNNs can $ε$ -isometrically embed any finite weighted tree into a hyperbolic space of dimension $d$ at least equal to $2$ with prescribed sectional curvature $κ < 0$ , for any $ε > 1$ (where $ε = 1$ being optimal). We establish rigorous upper bounds for the network complexity on an HNN implementing the embedding. We find that the network complexity of HNN implementing the graph representation is independent of the representation fidelity/distortion. We contrast this result against our lower bounds on distortion which any ReLU multi-layer perceptron (MLP) must exert when embedding a tree with $L > 2^{d}$ leaves into a $d$ -dimensional Euclidean space, which we show at least $Ω (L^{1/ d})$ ; independently of the depth,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Topological and Geometric Data Analysis · Stochastic Gradient Optimization Techniques