Cloud K-SVD: A Collaborative Dictionary Learning Algorithm for Big, Distributed Data
Haroon Raja, Waheed U. Bajwa

TL;DR
This paper introduces cloud K-SVD, a distributed algorithm for collaborative dictionary learning that enables multiple sites to learn a shared geometric data structure without sharing raw data, suitable for privacy-sensitive and large-scale applications.
Contribution
It proposes a novel distributed dictionary learning algorithm focused on union of subspaces, with theoretical analysis and practical validation on real and synthetic data.
Findings
Effective in learning shared dictionaries across distributed sites
Maintains data privacy by avoiding raw data exchange
Performs well on real and synthetic datasets
Abstract
This paper studies the problem of data-adaptive representations for big, distributed data. It is assumed that a number of geographically-distributed, interconnected sites have massive local data and they are interested in collaboratively learning a low-dimensional geometric structure underlying these data. In contrast to previous works on subspace-based data representations, this paper focuses on the geometric structure of a union of subspaces (UoS). In this regard, it proposes a distributed algorithm---termed cloud K-SVD---for collaborative learning of a UoS structure underlying distributed data of interest. The goal of cloud K-SVD is to learn a common overcomplete dictionary at each individual site such that every sample in the distributed data can be represented through a small number of atoms of the learned dictionary. Cloud K-SVD accomplishes this goal without requiring exchange of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
