Integrity Verification for Outsourcing Uncertain Frequent Itemset Mining
Qiwei Lu, Wenchao Huang, Yan Xiong, Xudong Gong

TL;DR
This paper addresses the challenge of verifying the correctness of uncertain frequent itemset mining results outsourced to third-party providers, extending existing verification methods to uncertain data scenarios with probabilistic guarantees.
Contribution
It extends deterministic outsourcing verification techniques to uncertain data, introduces improved schemes for different UFI definitions, and discusses probabilistic correctness guarantees.
Findings
Enhanced verification schemes for uncertain frequent itemsets.
Probabilistic guarantees for approximate UFI mining results.
Comparative analysis demonstrating scheme effectiveness.
Abstract
In recent years, due to the wide applications of uncertain data (e.g., noisy data), uncertain frequent itemsets (UFI) mining over uncertain databases has attracted much attention, which differs from the corresponding deterministic problem from the generalized definition and resolutions. As the most costly task in association rule mining process, it has been shown that outsourcing this task to a service provider (e.g.,the third cloud party) brings several benefits to the data owner such as cost relief and a less commitment to storage and computational resources. However, the correctness integrity of mining results can be corrupted if the service provider is with random fault or not honest (e.g., lazy, malicious, etc). Therefore, in this paper, we focus on the integrity and verification issue in UFI mining problem during outsourcing process, i.e., how the data owner verifies the mining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Rough Sets and Fuzzy Logic · Imbalanced Data Classification Techniques
