Online Prediction of Dyadic Data with Heterogeneous Matrix Factorization
Guangyong Chen, Fengyuan Zhu, Pheng Ann Heng

TL;DR
This paper introduces a Bayesian nonparametric framework called HeMF for dyadic data prediction, combining mixed membership and latent factor models, with an online learning algorithm that outperforms existing methods on large-scale datasets.
Contribution
It develops a unified HeMF model that automatically determines community numbers and efficiently exploits latent structures, along with a novel online Variational Bayesian inference method.
Findings
HeMF outperforms state-of-the-art methods on benchmark datasets.
The online learning approach improves estimation accuracy and robustness.
The model effectively handles large-scale dyadic data prediction problems.
Abstract
Dyadic Data Prediction (DDP) is an important problem in many research areas. This paper develops a novel fully Bayesian nonparametric framework which integrates two popular and complementary approaches, discrete mixed membership modeling and continuous latent factor modeling into a unified Heterogeneous Matrix Factorization~(HeMF) model, which can predict the unobserved dyadics accurately. The HeMF can determine the number of communities automatically and exploit the latent linear structure for each bicluster efficiently. We propose a Variational Bayesian method to estimate the parameters and missing data. We further develop a novel online learning approach for Variational inference and use it for the online learning of HeMF, which can efficiently cope with the important large-scale DDP problem. We evaluate the performance of our method on the EachMoive, MovieLens and Netflix Prize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Statistical Methods and Inference
