Distributed-Memory Randomized Algorithms for Sparse Tensor CP Decomposition
Vivek Bharadwaj, Osman Asif Malik, Riley Murray, Aydin Bulu\c{c},, James Demmel

TL;DR
This paper introduces distributed-memory randomized algorithms for sparse tensor CP decomposition, achieving significant speedups and scalability for billion-scale data, with strong theoretical guarantees and optimized communication strategies.
Contribution
It presents the first distributed-memory implementations of randomized CP algorithms with leverage score sampling, improving speed and scalability for large sparse tensors.
Findings
11x speedup over SPLATT on billion-scale Reddit tensor
Decomposition of billion-scale tensor in under two minutes on 512 cores
Algorithms offer strong theoretical guarantees with optimized communication
Abstract
Candecomp / PARAFAC (CP) decomposition, a generalization of the matrix singular value decomposition to higher-dimensional tensors, is a popular tool for analyzing multidimensional sparse data. On tensors with billions of nonzero entries, computing a CP decomposition is a computationally intensive task. We propose the first distributed-memory implementations of two randomized CP decomposition algorithms, CP-ARLS-LEV and STS-CP, that offer nearly an order-of-magnitude speedup at high decomposition ranks over well-tuned non-randomized decomposition packages. Both algorithms rely on leverage score sampling and enjoy strong theoretical guarantees, each with varying time and accuracy tradeoffs. We tailor the communication schedule for our random sampling algorithms, eliminating expensive reduction collectives and forcing communication costs to scale with the random sample count. Finally, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Tensor decomposition and applications · Advanced Neural Network Applications
