Privacy-Preserving Data Publishing in Process Mining

Majid Rafiei; Wil M.P. van der Aalst

arXiv:2101.02627·cs.CR·January 8, 2021

Privacy-Preserving Data Publishing in Process Mining

Majid Rafiei, Wil M.P. van der Aalst

PDF

TL;DR

This paper addresses the challenge of balancing privacy preservation and data utility in process mining by proposing formal definitions for anonymization operations, a privacy metadata infrastructure, and extending data standards.

Contribution

It introduces formal definitions for anonymization in process mining, a privacy metadata infrastructure, and a privacy extension for the XES standard, enhancing privacy-aware data publishing.

Findings

01

Formalized anonymization operations for process mining

02

Designed a privacy metadata recording infrastructure

03

Extended XES standard for privacy-aware event data

Abstract

Process mining aims to provide insights into the actual processes based on event data. These data are often recorded by information systems and are widely available. However, they often contain sensitive private information that should be analyzed responsibly. Therefore, privacy issues in process mining are recently receiving more attention. Privacy preservation techniques obviously need to modify the original data, yet, at the same time, they are supposed to preserve the data utility. Privacy-preserving transformations of the data may lead to incorrect or misleading analysis results. Hence, new infrastructures need to be designed for publishing the privacy-aware event data whose aim is to provide metadata regarding the privacy-related transformations on event data without revealing details of privacy preservation techniques or the protected information. In this paper, we provide formal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.