Privacy-Preserving Data Publishing in Process Mining
Majid Rafiei, Wil M.P. van der Aalst

TL;DR
This paper addresses the challenge of balancing privacy preservation and data utility in process mining by proposing formal definitions for anonymization operations, a privacy metadata infrastructure, and extending data standards.
Contribution
It introduces formal definitions for anonymization in process mining, a privacy metadata infrastructure, and a privacy extension for the XES standard, enhancing privacy-aware data publishing.
Findings
Formalized anonymization operations for process mining
Designed a privacy metadata recording infrastructure
Extended XES standard for privacy-aware event data
Abstract
Process mining aims to provide insights into the actual processes based on event data. These data are often recorded by information systems and are widely available. However, they often contain sensitive private information that should be analyzed responsibly. Therefore, privacy issues in process mining are recently receiving more attention. Privacy preservation techniques obviously need to modify the original data, yet, at the same time, they are supposed to preserve the data utility. Privacy-preserving transformations of the data may lead to incorrect or misleading analysis results. Hence, new infrastructures need to be designed for publishing the privacy-aware event data whose aim is to provide metadata regarding the privacy-related transformations on event data without revealing details of privacy preservation techniques or the protected information. In this paper, we provide formal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
