Towards Trustworthy Machine Learning in Production: An Overview of the Robustness in MLOps Approach
Firas Bayram, Bestoun S. Ahmed

TL;DR
This paper provides a comprehensive overview of robust MLOps practices, tools, and research approaches to enhance the trustworthiness and reliability of machine learning systems in real-world production environments.
Contribution
It surveys existing research, tools, and challenges related to robustness in MLOps, offering guidance for deploying trustworthy ML solutions in practice.
Findings
Identifies key technical practices for robust MLOps
Reviews tools supporting robustness in MLOps systems
Highlights open challenges and future research directions
Abstract
Artificial intelligence (AI), and especially its sub-field of Machine Learning (ML), are impacting the daily lives of everyone with their ubiquitous applications. In recent years, AI researchers and practitioners have introduced principles and guidelines to build systems that make reliable and trustworthy decisions. From a practical perspective, conventional ML systems process historical data to extract the features that are consequently used to train ML models that perform the desired task. However, in practice, a fundamental challenge arises when the system needs to be operationalized and deployed to evolve and operate in real-life environments continuously. To address this challenge, Machine Learning Operations (MLOps) have emerged as a potential recipe for standardizing ML solutions in deployment. Although MLOps demonstrated great success in streamlining ML processes, thoroughly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
