Scanflow: A multi-graph framework for Machine Learning workflow management, supervision, and debugging
Gusseppe Bravo-Rocca, Peini Liu, Jordi Guitart, Ajay Dholakia, David, Ellison, Jeffrey Falkanger, Miroslav Hodak

TL;DR
Scanflow is a containerized multi-graph framework designed to manage, supervise, and debug machine learning workflows, enabling collaborative, modular, and flexible model monitoring and improvement in production environments.
Contribution
The paper introduces a novel containerized directed graph framework that supports end-to-end ML workflow management, supervision, and debugging with human and machine collaboration.
Findings
Effective detection of data drift in multiple datasets
Integration of human intervention improves model robustness
Promising accuracy results on benchmark datasets
Abstract
Machine Learning (ML) is more than just training models, the whole workflow must be considered. Once deployed, a ML model needs to be watched and constantly supervised and debugged to guarantee its validity and robustness in unexpected situations. Debugging in ML aims to identify (and address) the model weaknesses in not trivial contexts. Several techniques have been proposed to identify different types of model weaknesses, such as bias in classification, model decay, adversarial attacks, etc., yet there is not a generic framework that allows them to work in a collaborative, modular, portable, iterative way and, more importantly, flexible enough to allow both human- and machine-driven techniques. In this paper, we propose a novel containerized directed graph framework to support and accelerate end-to-end ML workflow management, supervision, and debugging. The framework allows defining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Data Stream Mining Techniques · Adversarial Robustness in Machine Learning
