Machine Learning Systems: A Survey from a Data-Oriented Perspective
Christian Cabrera, Andrei Paleyes, Pierre Thodoroff, Neil D. Lawrence

TL;DR
This survey examines how data-oriented architecture (DOA) is implicitly adopted in deploying ML systems, highlighting design decisions, practices, and benefits for handling big data, latency, and security challenges.
Contribution
It explicitly analyzes the implicit adoption of DOA in ML system deployment, providing insights and practical advice to improve implementation practices.
Findings
Most ML systems partially adopt DOA principles.
Adoption of DOA improves handling of big data and low latency requirements.
Practical guidelines are proposed for deploying ML-based systems.
Abstract
Engineers are deploying ML models as parts of real-world systems with the upsurge of AI technologies. Real-world environments challenge the deployment of such systems because these environments produce large amounts of heterogeneous data, and users require increasingly efficient responses. These requirements push prevalent software architectures to the limit when deploying ML-based systems. Data-oriented Architecture (DOA) is an emerging style that equips systems better for integrating ML models. Even though papers on deployed ML systems do not mention DOA, their authors made design decisions that implicitly follow DOA. Implicit decisions create a knowledge gap, limiting the practitioners' ability to implement ML-based systems. \hlb{This paper surveys why, how, and to what extent practitioners have adopted DOA to implement and deploy ML-based systems.} We overcome the knowledge gap by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Cloud Computing and Resource Management · Big Data and Business Intelligence
