Fast and Geometrically Grounded Lorentz Neural Networks
Robert van der Klis, Ricardo Ch\'avez Torres, Max van Spengler, Yuhui Ding, Thomas Hofmann, Pascal Mettes

TL;DR
This paper introduces a new Lorentz linear layer for hyperbolic neural networks that maintains hyperbolic properties during training, enabling efficient and geometrically grounded learning in hyperbolic space.
Contribution
The authors propose a novel Lorentz linear layer based on the distance-to-hyperplane formulation, improving the scaling of hyperbolic norms during training.
Findings
The new Lorentz linear layer ensures linear scaling of hyperbolic norms.
Algorithmic efficiencies enable hyperbolic networks to match Euclidean computation speeds.
The approach preserves hyperbolic geometry properties throughout training.
Abstract
Hyperbolic space is quickly gaining traction as a promising geometry for hierarchical and robust representation learning. A core open challenge is the development of a mathematical formulation of hyperbolic neural networks that is both efficient and captures the key properties of hyperbolic space. The Lorentz model of hyperbolic space has been shown to enable both fast forward and backward propagation. However, we prove that, with the current formulation of Lorentz linear layers, the hyperbolic norms of the outputs scale logarithmically with the number of gradient descent steps, nullifying the key advantage of hyperbolic geometry. We propose a new Lorentz linear layer grounded in the well-known ``distance-to-hyperplane" formulation. We prove that our formulation results in the usual linear scaling of output hyperbolic norms with respect to the number of gradient descent steps. Our new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Advanced Graph Neural Networks · Tensor decomposition and applications
