Scalable and Sparsity-Aware Privacy-Preserving K-means Clustering with Application to Fraud Detection
Yingting Liu, Chaochao Chen, Jamie Cui, Li Wang, Lei Wang

TL;DR
This paper introduces a scalable, privacy-preserving K-means clustering framework optimized for sparse data, combining efficiency and security, and demonstrating effectiveness in fraud detection applications.
Contribution
The paper presents a novel sparsity-aware K-means framework with offline-online phases and vectorization, improving efficiency and privacy in large-scale, sparse data scenarios.
Findings
Achieves competitive performance in running time and communication size
Effective in large-scale, sparse datasets
Successfully applied to real-world fraud detection
Abstract
K-means is one of the most widely used clustering models in practice. Due to the problem of data isolation and the requirement for high model performance, how to jointly build practical and secure K-means for multiple parties has become an important topic for many applications in the industry. Existing work on this is mainly of two types. The first type has efficiency advantages, but information leakage raises potential privacy risks. The second type is provable secure but is inefficient and even helpless for the large-scale data sparsity scenario. In this paper, we propose a new framework for efficient sparsity-aware K-means with three characteristics. First, our framework is divided into a data-independent offline phase and a much faster online phase, and the offline phase allows to pre-compute almost all cryptographic operations. Second, we take advantage of the vectorization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Stochastic Gradient Optimization Techniques · Random Matrices and Applications
