Parallel Matrix Factorization for Binary Response
Rajiv Khanna, Liang Zhang, Deepak Agarwal, Beechung Chen

TL;DR
This paper introduces a parallel bilinear random effect model with adaptive rejection sampling for binary response data, significantly improving click rate prediction accuracy on large, imbalanced datasets.
Contribution
It develops a scalable parallel framework using Map-Reduce, novel adaptive rejection sampling, and ensemble methods to enhance binary response modeling in massive datasets.
Findings
Improved click rate prediction accuracy on benchmark datasets.
Scalable parallel algorithm for massive binary data.
Adaptive rejection sampling outperforms previous estimation techniques.
Abstract
Predicting user affinity to items is an important problem in applications like content optimization, computational advertising, and many more. While bilinear random effect models (matrix factorization) provide state-of-the-art performance when minimizing RMSE through a Gaussian response model on explicit ratings data, applying it to imbalanced binary response data presents additional challenges that we carefully study in this paper. Data in many applications usually consist of users' implicit response that are often binary -- clicking an item or not; the goal is to predict click rates, which is often combined with other measures to calculate utilities to rank items at runtime of the recommender systems. Because of the implicit nature, such data are usually much larger than explicit rating data and often have an imbalanced distribution with a small fraction of click events, making…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Consumer Market Behavior and Pricing
