Advancing Anomaly Detection in Computational Workflows with Active Learning
Krishnan Raghavan, George Papadimitriou, Hongwei Jin, Anirban Mandal,, Mariam Kiran, Prasanna Balaprakash, Ewa Deelman

TL;DR
This paper introduces an active learning approach supported by the Poseidon-X framework to improve anomaly detection in computational workflows, reducing data requirements and enhancing detection accuracy in large-scale scientific computing environments.
Contribution
It presents a novel active learning method integrated with Poseidon-X for anomaly detection in workflows, evaluated on real and benchmark data, demonstrating resource savings and improved accuracy.
Findings
Active learning reduces training data needs.
Active learning improves anomaly detection accuracy.
Framework effectively detects anomalies in large-scale workflows.
Abstract
A computational workflow, also known as workflow, consists of tasks that are executed in a certain order to attain a specific computational campaign. Computational workflows are commonly employed in science domains, such as physics, chemistry, genomics, to complete large-scale experiments in distributed and heterogeneous computing environments. However, running computations at such a large scale makes the workflow applications prone to failures and performance degradation, which can slowdown, stall, and ultimately lead to workflow failure. Learning how these workflows behave under normal and anomalous conditions can help us identify the causes of degraded performance and subsequently trigger appropriate actions to resolve them. However, learning in such circumstances is a challenging task because of the large volume of high-quality historical data needed to train accurate and reliable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Anomaly Detection Techniques and Applications · Machine Learning and Algorithms
