Prink: $k_s$-Anonymization for Streaming Data in Apache Flink
Philip Groneberg, Saskia Nu\~nez von Voigt, Thomas Janke, Louis Loechel, Karl Wolf, Elias Gr\"unewald, Frank Pallas

TL;DR
Prink is a practical, semantics-aware ks-anonymization method for streaming data in Apache Flink, enabling privacy-preserving data processing with minimal performance impact in real-world applications.
Contribution
It introduces the first semantics-aware ks-anonymization for non-numerical streaming data with native Flink integration, improving privacy, data utility, and ease of deployment.
Findings
Allows inclusion of non-numerical data in anonymization
Provides discrete data points instead of aggregates
Demonstrates acceptable performance overheads in realistic settings
Abstract
In this paper, we present Prink, a novel and practically applicable concept and fully implemented prototype for ks-anonymizing data streams in real-world application architectures. Building upon the pre-existing, yet rudimentary CASTLE scheme, Prink for the first time introduces semantics-aware ks-anonymization of non-numerical (such as categorical or hierarchically generalizable) streaming data in a information loss-optimized manner. In addition, it provides native integration into Apache Flink, one of the prevailing frameworks for enterprise-grade stream data processing in numerous application domains. Our contributions excel the previously established state of the art for the privacy guarantee-providing anonymization of streaming data in that they 1) allow to include non-numerical data in the anonymization process, 2) provide discrete datapoints instead of aggregates, thereby…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Data Quality and Management · Cryptography and Data Security
