Cross-level Privacy Preserving Utility Mining

Jiahong Cai; Wensheng Gan; Philip S. Yu

arXiv:2605.00036·cs.DB·May 4, 2026

Cross-level Privacy Preserving Utility Mining

Jiahong Cai, Wensheng Gan, Philip S. Yu

PDF

TL;DR

This paper introduces new algorithms for privacy-preserving utility mining that effectively hide sensitive cross-level high-utility itemsets while maintaining data utility, especially in sparse datasets.

Contribution

It proposes three novel CLPPUM algorithms with a new dictionary structure, improving efficiency and effectiveness in protecting generalized items in datasets.

Findings

01

All sensitive itemsets are successfully hidden without artificial itemsets.

02

Min-RF and Best-NSCF outperform Max-RF in various datasets.

03

Min-RF performs best when the utility threshold is low and datasets are dense.

Abstract

Privacy-preserving utility mining (PPUM) aims to hide sensitive high-utility patterns while preserving the utility of the sanitized database. In practice, however, many datasets are associated with taxonomic information, which makes the identification and processing of generalized items more challenging. To address this, we investigate the cross-level privacy-preserving utility mining (CLPPUM) problem and propose a method for protecting generalized items. Based on different victim item selection strategies, we develop three CLPPUM algorithms: minimum RGISU first (Min-RF), maximum RGISU first (Max-RF), and best NSC first (Best-NSCF). Furthermore, to enable efficient victim item identification, a novel dictionary structure named GI-dic is designed to accelerate the computation of required utility metrics. Experimental results on multiple datasets demonstrate that the proposed algorithms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.