Distributed Bayesian Matrix Decomposition for Big Data Mining and Clustering
Chihao Zhang, Yang Yang, Wei Zhang, Shihua Zhang

TL;DR
This paper introduces a distributed Bayesian matrix decomposition model designed for big data analytics, capable of handling heterogeneous noise, scaling efficiently, and addressing communication challenges in distributed systems.
Contribution
It proposes a novel distributed Bayesian matrix decomposition method with three implementation strategies and theoretical convergence analysis for big data mining and clustering.
Findings
Algorithms scale well to big data.
Achieves superior or competitive performance.
Effectively models heterogeneous noise.
Abstract
Matrix decomposition is one of the fundamental tools to discover knowledge from big data generated by modern applications. However, it is still inefficient or infeasible to process very big data using such a method in a single machine. Moreover, big data are often distributedly collected and stored on different machines. Thus, such data generally bear strong heterogeneous noise. It is essential and useful to develop distributed matrix decomposition for big data analytics. Such a method should scale up well, model the heterogeneous noise, and address the communication issue in a distributed system. To this end, we propose a distributed Bayesian matrix decomposition model (DBMD) for big data mining and clustering. Specifically, we adopt three strategies to implement the distributed computing including 1) the accelerated gradient descent, 2) the alternating direction method of multipliers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Blind Source Separation Techniques · Machine Learning and ELM
