ALTO: Adaptive Linearized Storage of Sparse Tensors

Ahmed E. Helal; Jan Laukemann; Fabio Checconi; Jesmin Jahan Tithi,; Teresa Ranadive; Fabrizio Petrini; Jeewhan Choi

arXiv:2102.10245·cs.DC·April 28, 2021

ALTO: Adaptive Linearized Storage of Sparse Tensors

Ahmed E. Helal, Jan Laukemann, Fabio Checconi, Jesmin Jahan Tithi,, Teresa Ranadive, Fabrizio Petrini, Jeewhan Choi

PDF

1 Repo

TL;DR

ALTO introduces a versatile sparse tensor storage format that improves processing efficiency and memory usage, significantly outperforming existing formats in tensor decomposition tasks.

Contribution

The paper presents ALTO, a novel mode-agnostic sparse tensor format with adaptive encoding that enhances performance and reduces memory footprint.

Findings

01

ALTO achieves an 8X speedup over state-of-the-art mode-agnostic formats.

02

ALTO provides a 4.3X better compression ratio compared to mode-specific formats.

03

ALTO reduces workload imbalance and synchronization overhead in tensor computations.

Abstract

The analysis of high-dimensional sparse data is becoming increasingly popular in many important domains. However, real-world sparse tensors are challenging to process due to their irregular shapes and data distributions. We propose the Adaptive Linearized Tensor Order (ALTO) format, a novel mode-agnostic (general) representation that keeps neighboring nonzero elements in the multi-dimensional space close to each other in memory. To generate the indexing metadata, ALTO uses an adaptive bit encoding scheme that trades off index computations for lower memory usage and more effective use of memory bandwidth. Moreover, by decoupling its sparse representation from the irregular spatial distribution of nonzero elements, ALTO eliminates the workload imbalance and greatly reduces the synchronization overhead of tensor computations. As a result, the parallel performance of ALTO-based tensor…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IntelLabs/ALTO
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.