Synchronous and Asynchronous Parallelism Approaches for Generalized Canonical Polyadic Tensor Decomposition with GenTen
Jeremy M. Myers, Eric T. Phipps

TL;DR
This paper develops and evaluates synchronous and asynchronous parallel algorithms for generalized CP tensor decomposition, leveraging shared and distributed memory systems, to improve scalability and interpretability of high-dimensional data analysis.
Contribution
It introduces parallel GCP tensor decomposition algorithms using Kokkos and MPI, including an asynchronous approach inspired by federated learning, enhancing scalability and flexibility.
Findings
Parallel algorithms improve scalability on large datasets.
Asynchronous approach offers better efficiency with comparable accuracy.
Methods support various loss functions and data sparsity patterns.
Abstract
The Canonical Polyadic (CP) tensor decomposition is a well-known method for interpretable analysis of high-dimensional data. Recently, the Generalized CP method (GCP) was introduced by Hong and Kolda to allow for flexible choice of the loss function in the optimization problem defining the CP model, enabling more interpretable decompositions of strongly non-Gaussian data such as count or binary data. Furthermore, Kolda and Hong introduced a version of GCP that leverages randomization and stochastic optimization to address scalability to large, sparse data sets. In this work, we take these ideas a step further and consider synchronous and asynchronous algorithms for parallel GCP tensor decomposition through the GenTen software package, exploiting both shared and distributed memory parallelism. We build on shared memory parallel CP decomposition algorithms utilizing Kokkos for portability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
