Parallel Nonnegative CP Decomposition of Dense Tensors
Grey Ballard, Koby Hayashi, Ramakrishnan Kannan

TL;DR
This paper introduces a scalable parallel algorithm for nonnegative CP tensor decomposition of dense data, optimizing computation and communication to handle large-scale tensors efficiently.
Contribution
It presents a novel distributed-memory parallel algorithm with optimized MTTKRP computation and communication strategies for nonnegative CP decomposition of dense tensors.
Findings
Scales efficiently to hundreds of nodes and thousands of cores.
Faster and more general than existing parallel software.
Effective on synthetic, hyperspectral, and neuroscience data.
Abstract
The CP tensor decomposition is a low-rank approximation of a tensor. We present a distributed-memory parallel algorithm and implementation of an alternating optimization method for computing a CP decomposition of dense tensor data that can enforce nonnegativity of the computed low-rank factors. The principal task is to parallelize the matricized-tensor times Khatri-Rao product (MTTKRP) bottleneck subcomputation. The algorithm is computation efficient, using dimension trees to avoid redundant computation across MTTKRPs within the alternating method. Our approach is also communication efficient, using a data distribution and parallel algorithm across a multidimensional processor grid that can be tuned to minimize communication. We benchmark our software on synthetic as well as hyperspectral image and neuroscience dynamic functional connectivity data, demonstrating that our algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
