Factorbird - a Parameter Server Approach to Distributed Matrix Factorization
Sebastian Schelter, Venu Satuluri, Reza Zadeh

TL;DR
Factorbird is a scalable, flexible parameter server system designed for large-scale matrix factorization using SGD, capable of handling massive matrices exceeding memory limits and adaptable to various models and streaming data.
Contribution
Introduces Factorbird, a novel distributed matrix factorization system that scales to billions of non-zero entries and supports diverse models and streaming scenarios.
Findings
Successfully factorized a 38-billion non-zero matrix from Twitter data.
Demonstrated scalability and efficiency of the system on extremely large matrices.
Achieved state-of-the-art results in large-scale matrix factorization literature.
Abstract
We present Factorbird, a prototype of a parameter server approach for factorizing large matrices with Stochastic Gradient Descent-based algorithms. We designed Factorbird to meet the following desiderata: (a) scalability to tall and wide matrices with dozens of billions of non-zeros, (b) extensibility to different kinds of models and loss functions as long as they can be optimized using Stochastic Gradient Descent (SGD), and (c) adaptability to both batch and streaming scenarios. Factorbird uses a parameter server in order to scale to models that exceed the memory of an individual machine, and employs lock-free Hogwild!-style learning with a special partitioning scheme to drastically reduce conflicting updates. We also discuss other aspects of the design of our system such as how to efficiently grid search for hyperparameters at scale. We present experiments of Factorbird on a matrix…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Graph Theory and Algorithms · Advanced Graph Neural Networks
