# Graphyti: A Semi-External Memory Graph Library for FlashGraph

**Authors:** Disa Mhembere, Da Zheng, Carey E. Priebe, Joshua T. Vogelstein and, Randal Burns

arXiv: 1907.03335 · 2019-07-09

## TL;DR

Graphyti is a new semi-external memory graph library that enables efficient processing of large graphs on a single machine by minimizing I/O and memory use, achieving near in-memory performance.

## Contribution

It introduces principles for SEM application development and provides an extensible Python library built on FlashGraph for high-performance graph analytics.

## Key findings

- Achieves 80% of in-memory performance in SEM.
- Outperforms distributed engines like PowerGraph and Galois.
- Built on FlashGraph, available via pip.

## Abstract

Graph datasets exceed the in-memory capacity of most standalone machines. Traditionally, graph frameworks have overcome memory limitations through scale-out, distributing computing. Emerging frameworks avoid the network bottleneck of distributed data with Semi-External Memory (SEM) that uses a single multicore node and operates on graphs larger than memory. In SEM, $\mathcal{O}(m)$ data resides on disk and $\mathcal{O}(n)$ data in memory, for a graph with $n$ vertices and $m$ edges. For developers, this adds complexity because they must explicitly encode I/O within applications. We present principles that are critical for application developers to adopt in order to achieve state-of-the-art performance, while minimizing I/O and memory for algorithms in SEM. We present them in Graphyti, an extensible parallel SEM graph library built on FlashGraph and available in Python via pip. In SEM, Graphyti achieves 80% of the performance of in-memory execution and retains the performance of FlashGraph, which outperforms distributed engines, such as PowerGraph and Galois.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.03335/full.md

## Figures

19 figures with captions in the complete paper: https://tomesphere.com/paper/1907.03335/full.md

## References

24 references — full list in the complete paper: https://tomesphere.com/paper/1907.03335/full.md

---
Source: https://tomesphere.com/paper/1907.03335