TL;DR
This paper introduces GENLOG, a neural network-based method for generating reliable process event streams and time series data, addressing data scarcity and variability issues in process monitoring applications.
Contribution
GENLOG is a novel approach that uses data resampling and neural networks to generate realistic event and time series data from limited or incomplete datasets.
Findings
GENLOG effectively boosts small datasets for process mining.
Generated data maintains the distribution and characteristics of original data.
The approach supports handling variations and concept drifts in data collection.
Abstract
Domains such as manufacturing and medicine crave for continuous monitoring and analysis of their processes, especially in combination with time series as produced by sensors. Time series data can be exploited to, for example, explain and predict concept drifts during runtime. Generally, a certain data volume is required in order to produce meaningful analysis results. However, reliable data sets are often missing, for example, if event streams and times series data are collected separately, in case of a new process, or if it is too expensive to obtain a sufficient data volume. Additional challenges arise with preparing time series data from multiple event sources, variations in data collection frequency, and concept drift. This paper proposes the GENLOG approach to generate reliable event and time series data that follows the distribution of the underlying input data set. GENLOG employs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
