Why are hyperbolic neural networks effective? A study on hierarchical   representation capability

Shicheng Tan; Huanjing Zhao; Shu Zhao; Yanping Zhang

arXiv:2402.02478·cs.LG·February 6, 2024·1 cites

Why are hyperbolic neural networks effective? A study on hierarchical representation capability

Shicheng Tan, Huanjing Zhao, Shu Zhao, Yanping Zhang

PDF

Open Access 1 Repo

TL;DR

This paper investigates why hyperbolic neural networks are effective, revealing they do not achieve optimal hierarchical embedding, and proposes strategies to enhance their hierarchical representation capability and downstream performance.

Contribution

The study provides a benchmark for evaluating hierarchical representation capability in HNNs and introduces pre-training strategies to improve their effectiveness.

Findings

01

HNNs do not reach theoretical optimal embedding.

02

Hierarchical Representation Capability is influenced by objectives and structure.

03

Pre-training strategies significantly boost HNN performance.

Abstract

Hyperbolic Neural Networks (HNNs), operating in hyperbolic space, have been widely applied in recent years, motivated by the existence of an optimal embedding in hyperbolic space that can preserve data hierarchical relationships (termed Hierarchical Representation Capability, HRC) more accurately than Euclidean space. However, there is no evidence to suggest that HNNs can achieve this theoretical optimal embedding, leading to much research being built on flawed motivations. In this paper, we propose a benchmark for evaluating HRC and conduct a comprehensive analysis of why HNNs are effective through large-scale experiments. Inspired by the analysis results, we propose several pre-training strategies to enhance HRC and improve the performance of downstream tasks, further validating the reliability of the analysis. Experiments show that HNNs cannot achieve the theoretical optimal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aitsc/hyperbolic-properties
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications