Towards Non-Euclidean Foundation Models: Advancing AI Beyond Euclidean Frameworks

Menglin Yang; Yifei Zhang; Jialin Chen; Melanie Weber; Rex Ying

arXiv:2505.14417·cs.CG·May 21, 2025

Towards Non-Euclidean Foundation Models: Advancing AI Beyond Euclidean Frameworks

Menglin Yang, Yifei Zhang, Jialin Chen, Melanie Weber, Rex Ying

PDF

TL;DR

This paper discusses the development of foundation models that utilize non-Euclidean geometries like hyperbolic and spherical spaces to better capture complex data structures, especially in web-related applications.

Contribution

It introduces the concept of Non-Euclidean Foundation Models and explores their potential to overcome limitations of Euclidean-based models in representing complex data.

Findings

01

Non-Euclidean spaces improve data representation efficiency.

02

Integration with foundation models enhances modeling of web data.

03

Potential for improved performance in search and recommendation systems.

Abstract

In the era of foundation models and Large Language Models (LLMs), Euclidean space is the de facto geometric setting of our machine learning architectures. However, recent literature has demonstrated that this choice comes with fundamental limitations. To that end, non-Euclidean learning is quickly gaining traction, particularly in web-related applications where complex relationships and structures are prevalent. Non-Euclidean spaces, such as hyperbolic, spherical, and mixed-curvature spaces, have been shown to provide more efficient and effective representations for data with intrinsic geometric properties, including web-related data like social network topology, query-document relationships, and user-item interactions. Integrating foundation models with non-Euclidean geometries has great potential to enhance their ability to capture and model the underlying structures, leading to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.