Hyperbolic Deep Learning for Foundation Models: A Survey

Neil He; Hiren Madhu; Ngoc Bui; Menglin Yang; Rex Ying

arXiv:2507.17787·cs.LG·July 25, 2025

Hyperbolic Deep Learning for Foundation Models: A Survey

Neil He, Hiren Madhu, Ngoc Bui, Menglin Yang, Rex Ying

PDF

TL;DR

This survey reviews how hyperbolic geometry can enhance foundation models by addressing limitations of Euclidean spaces, improving reasoning, generalization, and efficiency in large-scale AI systems.

Contribution

It provides a comprehensive overview of hyperbolic neural networks and their application to foundation models, highlighting recent advances and future research directions.

Findings

01

Hyperbolic spaces enable efficient hierarchical and power-law data embeddings.

02

Applying hyperbolic geometry improves reasoning and zero-shot generalization in foundation models.

03

Hyperbolic methods maintain parameter efficiency while enhancing model capabilities.

Abstract

Foundation models pre-trained on massive datasets, including large language models (LLMs), vision-language models (VLMs), and large multimodal models, have demonstrated remarkable success in diverse downstream tasks. However, recent studies have shown fundamental limitations of these models: (1) limited representational capacity, (2) lower adaptability, and (3) diminishing scalability. These shortcomings raise a critical question: is Euclidean geometry truly the optimal inductive bias for all foundation models, or could incorporating alternative geometric spaces enable models to better align with the intrinsic structure of real-world data and improve reasoning processes? Hyperbolic spaces, a class of non-Euclidean manifolds characterized by exponential volume growth with respect to distance, offer a mathematically grounded solution. These spaces enable low-distortion embeddings of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.