Hyperbolic Deep Learning for Foundation Models: A Survey
Neil He, Hiren Madhu, Ngoc Bui, Menglin Yang, Rex Ying

TL;DR
This survey reviews how hyperbolic geometry can enhance foundation models by addressing limitations of Euclidean spaces, improving reasoning, generalization, and efficiency in large-scale AI systems.
Contribution
It provides a comprehensive overview of hyperbolic neural networks and their application to foundation models, highlighting recent advances and future research directions.
Findings
Hyperbolic spaces enable efficient hierarchical and power-law data embeddings.
Applying hyperbolic geometry improves reasoning and zero-shot generalization in foundation models.
Hyperbolic methods maintain parameter efficiency while enhancing model capabilities.
Abstract
Foundation models pre-trained on massive datasets, including large language models (LLMs), vision-language models (VLMs), and large multimodal models, have demonstrated remarkable success in diverse downstream tasks. However, recent studies have shown fundamental limitations of these models: (1) limited representational capacity, (2) lower adaptability, and (3) diminishing scalability. These shortcomings raise a critical question: is Euclidean geometry truly the optimal inductive bias for all foundation models, or could incorporating alternative geometric spaces enable models to better align with the intrinsic structure of real-world data and improve reasoning processes? Hyperbolic spaces, a class of non-Euclidean manifolds characterized by exponential volume growth with respect to distance, offer a mathematically grounded solution. These spaces enable low-distortion embeddings of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
