$DC^2$: A Divide-and-conquer Algorithm for Large-scale Kernel Learning with Application to Clustering
Ke Alexander Wang, Xinran Bian, Pan Liu, Donghui Yan

TL;DR
The paper introduces the $DC^2$ algorithm, a divide-and-conquer approach for large-scale kernel learning that uses recursive random projections for data partitioning, enabling efficient, parallel clustering with minimal accuracy loss.
Contribution
The $DC^2$ algorithm offers a novel structure-preserving data division method that improves large-scale kernel learning efficiency and accuracy over traditional sampling-based approaches.
Findings
$DC^2$ achieves high clustering accuracy comparable to fast spectral clustering methods.
Recursive random projections reduce approximation errors compared to random sampling.
The algorithm enables parallel computation, significantly improving scalability.
Abstract
Divide-and-conquer is a general strategy to deal with large scale problems. It is typically applied to generate ensemble instances, which potentially limits the problem size it can handle. Additionally, the data are often divided by random sampling which may be suboptimal. To address these concerns, we propose the algorithm. Instead of ensemble instances, we produce structure-preserving signature pieces to be assembled and conquered. achieves the efficiency of sampling-based large scale kernel methods while enabling parallel multicore or clustered computation. The data partition and subsequent compression are unified by recursive random projections. Empirically dividing the data by random projections induces smaller mean squared approximation errors than conventional random sampling. The power of is demonstrated by our clustering algorithm , which is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Advanced Clustering Algorithms Research · Sparse and Compressive Sensing Techniques
MethodsSpectral Clustering
