Dynamic and Scalable Data Preparation for Object-Centric Process Mining
Lien Bosmans, Jari Peeperkorn, Alexandre Goossens, Giovanni Lugaresi,, Johannes De Smedt, Jochen De Weerdt

TL;DR
This paper introduces a scalable, flexible database format for object-centric process mining that supports continuous data ingestion, transformation, and analysis, addressing limitations of existing static event log formats.
Contribution
It proposes a novel relational schema and an end-to-end implementation for robust, scalable object-centric event log storage and processing.
Findings
Effective handling of streaming data with new object types and attributes
Improved data quality assessment and visualization capabilities
Validated through a lightweight, open-source data stack implementation
Abstract
Object-centric process mining is emerging as a promising paradigm across diverse industries, drawing substantial academic attention. To support its data requirements, existing object-centric data formats primarily facilitate the exchange of static event logs between data owners, researchers, and analysts, rather than serving as a robust foundational data model for continuous data ingestion and transformation pipelines for subsequent storage and analysis. This focus results into suboptimal design choices in terms of flexibility, scalability, and maintainability. For example, it is difficult for current object-centric event log formats to deal with novel object types or new attributes in case of streaming data. This paper proposes a database format designed for an intermediate data storage hub, which segregates process mining applications from their data sources using a hub-and-spoke…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBusiness Process Modeling and Analysis
