NOMAD: Generating Embeddings for Massive Distributed Graphs

Aishwarya Sarkar; Sayan Ghosh; Nathan R. Tallent; Ali Jannesari

arXiv:2604.09419·cs.LG·April 13, 2026

NOMAD: Generating Embeddings for Massive Distributed Graphs

Aishwarya Sarkar, Sayan Ghosh, Nathan R. Tallent, Ali Jannesari

PDF

TL;DR

NOMAD is a distributed-memory framework for graph embeddings that scales efficiently to massive graphs, leveraging MPI and proximity-based models to improve speed and maintain quality.

Contribution

It introduces a scalable MPI-based framework for graph embeddings, optimizing trade-offs for large-scale distributed graphs in web and scientific domains.

Findings

01

Achieves median speedups of 10/100x on CPU clusters compared to reference implementations.

02

Demonstrates 35-76x speedup over distributed PBG.

03

Maintains competitive embedding quality with significant end-to-end speedups.

Abstract

Successful machine learning on graphs or networks requires embeddings that not only represent nodes and edges as low-dimensional vectors but also preserve the graph structure. Established methods for generating embeddings require flexible exploration of the entire graph through repeated use of random walks that capture graph structure with samples of nodes and edges. These methods create scalability challenges for massive graphs with millions-to-billions of edges because single-node solutions have inadequate memory and processing capabilities. We present NOMAD, a distributed-memory graph embedding framework using the Message Passing Interface (MPI) for distributed graphs. NOMAD implements proximity-based models proposed in the widely popular LINE (Large-scale Information Network Embedding) algorithm. We propose several practical trade-offs to improve the scalability and communication…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.