Yggdrasil: Privacy-aware Dual Deduplication in Multi Client Settings
Hadi Sehat, Elena Pagnin, Daniel E.Lucani

TL;DR
Yggdrasil is a privacy-aware deduplication protocol that reduces cloud storage by combining generalized deduplication, non-deterministic transformations, and client-side preprocessing, ensuring data privacy and high compression.
Contribution
The paper introduces Yggdrasil, a novel protocol that enhances data deduplication with privacy guarantees and improved compression in multi-client cloud storage settings.
Findings
Achieves 49% overall data compression
Clients store only 12% of data, cloud stores the rest
Provides high data uncertainty, with 10^296 possible original strings
Abstract
This paper proposes Yggdrasil, a protocol for privacy-aware dual data deduplication in multi client settings. Yggdrasil is designed to reduce the cloud storage space while safeguarding the privacy of the client's outsourced data. Yggdrasil combines three innovative tools to achieve this goal. First, generalized deduplication, an emerging technique to reduce data footprint. Second, non-deterministic transformations that are described compactly and improve the degree of data compression in the Cloud (across users). Third, data preprocessing in the clients in the form of lightweight, privacy-driven transformations prior to upload. This guarantees that an honest-but-curious Cloud service trying to retrieve the client's actual data will face a high degree of uncertainty as to what the original data is. We provide a mathematical analysis of the measure of uncertainty as well as the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
