Cross-level Privacy Preserving Utility Mining
Jiahong Cai, Wensheng Gan, Philip S. Yu

TL;DR
This paper introduces new algorithms for privacy-preserving utility mining that effectively hide sensitive cross-level high-utility itemsets while maintaining data utility, especially in sparse datasets.
Contribution
It proposes three novel CLPPUM algorithms with a new dictionary structure, improving efficiency and effectiveness in protecting generalized items in datasets.
Findings
All sensitive itemsets are successfully hidden without artificial itemsets.
Min-RF and Best-NSCF outperform Max-RF in various datasets.
Min-RF performs best when the utility threshold is low and datasets are dense.
Abstract
Privacy-preserving utility mining (PPUM) aims to hide sensitive high-utility patterns while preserving the utility of the sanitized database. In practice, however, many datasets are associated with taxonomic information, which makes the identification and processing of generalized items more challenging. To address this, we investigate the cross-level privacy-preserving utility mining (CLPPUM) problem and propose a method for protecting generalized items. Based on different victim item selection strategies, we develop three CLPPUM algorithms: minimum RGISU first (Min-RF), maximum RGISU first (Max-RF), and best NSC first (Best-NSCF). Furthermore, to enable efficient victim item identification, a novel dictionary structure named GI-dic is designed to accelerate the computation of required utility metrics. Experimental results on multiple datasets demonstrate that the proposed algorithms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
