Dictionary Learning for Massive Matrix Factorization
Arthur Mensch (PARIETAL), Julien Mairal (LEAR), Bertrand Thirion, (PARIETAL), Ga\"el Varoquaux (PARIETAL)

TL;DR
This paper introduces a scalable matrix factorization method for massive datasets, enabling efficient processing of terabyte-scale matrices in applications like fMRI analysis and recommender systems.
Contribution
A novel matrix factorization approach that scales to terabyte-sized datasets, outperforming existing methods in speed and efficiency.
Findings
Achieves significant speed-ups over state-of-the-art methods.
Successfully applied to large fMRI datasets for data analysis.
Effective in matrix completion tasks for recommender systems.
Abstract
Sparse matrix factorization is a popular tool to obtain interpretable data decompositions, which are also effective to perform data completion or denoising. Its applicability to large datasets has been addressed with online and randomized methods, that reduce the complexity in one of the matrix dimension, but not in both of them. In this paper, we tackle very large matrices in both dimensions. We propose a new factoriza-tion method that scales gracefully to terabyte-scale datasets, that could not be processed by previous algorithms in a reasonable amount of time. We demonstrate the efficiency of our approach on massive functional Magnetic Resonance Imaging (fMRI) data, and on matrix completion problems for recommender systems, where we obtain significant speed-ups compared to state-of-the art coordinate descent methods.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Tensor decomposition and applications · Face and Expression Recognition
