An Empirical Evaluation of Flow Based Programming in the Machine Learning Deployment Context
Andrei Paleyes, Christian Cabrera, Neil D. Lawrence

TL;DR
This paper empirically evaluates Flow-Based Programming (FBP) for deploying machine learning in large systems, showing it simplifies data collection and discovery compared to Service Oriented Architecture (SOA).
Contribution
It demonstrates FBP as a practical paradigm for Data Oriented Architecture in ML deployment, filling a gap in implementation guidance.
Findings
FBP simplifies data collection and discovery.
FBP is suitable for data science tasks.
Compared to SOA, FBP improves code quality metrics.
Abstract
As use of data driven technologies spreads, software engineers are more often faced with the task of solving a business problem using data-driven methods such as machine learning (ML) algorithms. Deployment of ML within large software systems brings new challenges that are not addressed by standard engineering practices and as a result businesses observe high rate of ML deployment project failures. Data Oriented Architecture (DOA) is an emerging approach that can support data scientists and software developers when addressing such challenges. However, there is a lack of clarity about how DOA systems should be implemented in practice. This paper proposes to consider Flow-Based Programming (FBP) as a paradigm for creating DOA applications. We empirically evaluate FBP in the context of ML deployment on four applications that represent typical data science projects. We use Service Oriented…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Software System Performance and Reliability · Scientific Computing and Data Management
Methodstravel james
