Automated Batch Distillation Process Simulation for a Large Hybrid Dataset for Deep Anomaly Detection
Jennifer Werner, Justus Arweiler, Indra Jungjohann, Jochen Schmid, Fabian Jirasek, Hans Hasse, Michael Bortz

TL;DR
This paper presents an automated Python-based simulation workflow that creates a large hybrid dataset for deep anomaly detection in batch distillation, combining experimental and simulated data for improved process monitoring.
Contribution
It introduces a novel automated simulation process that generates a comprehensive hybrid dataset, facilitating deep anomaly detection research in chemical processes.
Findings
Automated simulation accurately predicts dynamics across experiments.
Hybrid dataset covers normal and anomalous operating conditions.
Open release of the large, annotated dataset supports future research.
Abstract
Anomaly detection (AD) in chemical processes based on deep learning offers significant opportunities but requires large, diverse, and well-annotated training datasets that are rarely available from industrial operations. In a recent work, we introduced a large, fully annotated experimental dataset for batch distillation under normal and anomalous operating conditions. In the present study, we augment this dataset with a corresponding simulation dataset, creating a novel hybrid dataset. The simulation data is generated in an automated workflow with a novel Python-based process simulator that employs a tailored index-reduction strategy for the underlying differential-algebraic equations. Leveraging the rich metadata and structured anomaly annotations of the experimental database, experimental records are automatically translated into simulation scenarios. After calibration to a single…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
