TL;DR
This paper introduces an accelerated ALS algorithm for collaborative filtering using nonlinear conjugate gradient, significantly reducing convergence time in distributed environments like Spark, and demonstrating scalability and efficiency on large datasets.
Contribution
The paper presents a novel ALS acceleration method with NCG wrapper and an efficient line search, implemented in Spark for large-scale collaborative filtering tasks.
Findings
ALS-NCG requires fewer iterations than ALS alone.
Achieves up to 4x speedup in distributed environments.
Scales linearly with problem size on large datasets.
Abstract
Collaborative filtering algorithms are important building blocks in many practical recommendation systems. For example, many large-scale data processing environments include collaborative filtering models for which the Alternating Least Squares (ALS) algorithm is used to compute latent factor matrix decompositions. In this paper, we propose an approach to accelerate the convergence of parallel ALS-based optimization methods for collaborative filtering using a nonlinear conjugate gradient (NCG) wrapper around the ALS iterations. We also provide a parallel implementation of the accelerated ALS-NCG algorithm in the Apache Spark distributed data processing environment, and an efficient line search technique as part of the ALS-NCG implementation that requires only one pass over the data on distributed datasets. In serial numerical experiments on a linux workstation and parallel numerical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
