An Experimental Comparison of Alternative Techniques for Event-Log Augmentation
Alessandro Padella, Francesco Vinci, Massimiliano de Leoni

TL;DR
This paper compares seven event-log augmentation techniques to determine which best enhances process mining, finding that stochastic transition system-based methods with resource modeling produce higher quality synthetic logs, outperforming traditional data augmentation.
Contribution
It provides the first comprehensive evaluation of state-of-the-art event-log augmentation techniques across multiple criteria and compares them with traditional data augmentation methods.
Findings
Stochastic transition system with resource modeling yields higher quality logs.
Event-log augmentation outperforms traditional data augmentation.
Augmentation techniques improve process mining effectiveness.
Abstract
Process mining analyzes and improves processes by examining transactional data stored in event logs, which record sequences of events with timestamps. However, the effectiveness of process mining, especially when combined with machine or deep learning, depends on having large event logs. Event log augmentation addresses this limitation by generating additional traces that simulate realistic process executions while considering various perspectives like time, control-flow, workflow, resources, and domain-specific attributes. Although prior research has explored event-log augmentation techniques, there has been no comprehensive comparison of their effectiveness. This paper reports on an evaluation of seven state-of-the-art augmentation techniques across eight event logs. The results are also compared with those obtained by a baseline technique based on a stochastic transition system. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBusiness Process Modeling and Analysis · Big Data and Business Intelligence · Software System Performance and Reliability
