The Impact of Event Data Partitioning on Privacy-aware Process Discovery
Jungeun Lim, Stephan A. Fahrenkrog-Petersen, Xixi Lu, Jan Mendling, Minseok Song

TL;DR
This paper proposes a method combining anonymization and event data partitioning using event abstraction to enhance privacy preservation in process discovery without significantly sacrificing utility.
Contribution
It introduces a novel pipeline that segments event logs via event abstraction, improving the utility of anonymized logs for process discovery.
Findings
Event partitioning improves process discovery utility.
Partitioning reduces utility loss in anonymization.
The approach is validated on real-world logs with positive results.
Abstract
Information systems support the execution of business processes. The event logs of these executions generally contain sensitive information about customers, patients, and employees. The corresponding privacy challenges can be addressed by anonymizing the event logs while still retaining utility for process discovery. However, trading off utility and privacy is difficult: the higher the complexity of event log, the higher the loss of utility by anonymization. In this work, we propose a pipeline that combines anonymization and event data partitioning, where event abstraction is utilized for partitioning. By leveraging event abstraction, event logs can be segmented into multiple parts, allowing each sub-log to be anonymized separately. This pipeline preserves privacy while mitigating the loss of utility. To validate our approach, we study the impact of event partitioning on two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Business Process Modeling and Analysis · Access Control and Trust
