cuFasterTucker: A Stochastic Optimization Strategy for Parallel Sparse FastTucker Decomposition on GPU Platform
Zixuan Li

TL;DR
This paper introduces cuFasterTucker, a GPU-based stochastic optimization method that significantly accelerates sparse high-order tensor decomposition, enabling efficient analysis of large-scale scientific data.
Contribution
It proposes a novel parallel FasterTucker algorithm on GPU that reduces storage and computational costs for high-dimensional sparse tensor decomposition.
Findings
Achieves approximately 15x speedup in updating factor matrices
Achieves approximately 7x speedup in updating core matrices
Effectively handles high-order, high-dimensional sparse tensors
Abstract
Currently, the size of scientific data is growing at an unprecedented rate. Data in the form of tensors exhibit high-order, high-dimensional, and highly sparse features. Although tensor-based analysis methods are very effective, the large increase in data size makes the original tensor impossible to process. Tensor decomposition decomposes a tensor into multiple low-rank matrices or tensors that can be exploited by tensor-based analysis methods. Tucker decomposition is such an algorithm, which decomposes a -order tensor into low-rank factor matrices and a low-rank core tensor. However, most Tucker decomposition methods are accompanied by huge intermediate variables and huge computational load, making them unable to process high-order and high-dimensional tensors. In this paper, we propose FasterTucker decomposition based on FastTucker decomposition, which is a variant of Tucker…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications
