Scanflow: A multi-graph framework for Machine Learning workflow   management, supervision, and debugging

Gusseppe Bravo-Rocca; Peini Liu; Jordi Guitart; Ajay Dholakia; David; Ellison; Jeffrey Falkanger; Miroslav Hodak

arXiv:2111.03003·cs.LG·November 5, 2021

Scanflow: A multi-graph framework for Machine Learning workflow management, supervision, and debugging

Gusseppe Bravo-Rocca, Peini Liu, Jordi Guitart, Ajay Dholakia, David, Ellison, Jeffrey Falkanger, Miroslav Hodak

PDF

Open Access 1 Repo

TL;DR

Scanflow is a containerized multi-graph framework designed to manage, supervise, and debug machine learning workflows, enabling collaborative, modular, and flexible model monitoring and improvement in production environments.

Contribution

The paper introduces a novel containerized directed graph framework that supports end-to-end ML workflow management, supervision, and debugging with human and machine collaboration.

Findings

01

Effective detection of data drift in multiple datasets

02

Integration of human intervention improves model robustness

03

Promising accuracy results on benchmark datasets

Abstract

Machine Learning (ML) is more than just training models, the whole workflow must be considered. Once deployed, a ML model needs to be watched and constantly supervised and debugged to guarantee its validity and robustness in unexpected situations. Debugging in ML aims to identify (and address) the model weaknesses in not trivial contexts. Several techniques have been proposed to identify different types of model weaknesses, such as bias in classification, model decay, adversarial attacks, etc., yet there is not a generic framework that allows them to work in a collaborative, modular, portable, iterative way and, more importantly, flexible enough to allow both human- and machine-driven techniques. In this paper, we propose a novel containerized directed graph framework to support and accelerate end-to-end ML workflow management, supervision, and debugging. The framework allows defining…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gusseppe/scanflow
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management · Data Stream Mining Techniques · Adversarial Robustness in Machine Learning