An Efficient Model Maintenance Approach for MLOps
Forough Majidi, Foutse Khomh, Heng Li, Amin Nikanjam

TL;DR
This paper introduces an efficient MLOps pipeline with a novel model reuse approach and a tool called SimReuse, which leverages recurrent data patterns to reduce retraining costs and maintain model performance over time.
Contribution
It presents a new model maintenance method that reuses models based on data similarity, integrated into an improved MLOps pipeline and supported by the SimReuse tool.
Findings
Reduces model maintenance time and costs to 1/8th of traditional methods.
Maintains model performance comparable to state-of-the-art baselines.
Effectively leverages recurrent data patterns in time series datasets.
Abstract
In recent years, many industries have utilized machine learning (ML) models in their systems. Ideally, ML models should be trained on and applied to data from the same distributions. However, the data evolves over time in many application areas, leading to concept drift, which in turn causes the performance of the ML models to degrade over time. Therefore, maintaining up-to-date ML models plays a critical role in the MLOps pipeline. Existing ML model maintenance approaches are often computationally resource-intensive, costly, time-consuming, and model-dependent. Thus, we propose an improved MLOps pipeline, a new model maintenance approach and a Similarity-Based Model Reuse (SimReuse) tool to address the challenges of ML model maintenance. We identify seasonal and recurrent data distribution patterns in time series datasets throughout a preliminary study. Recurrent data distribution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Semantic Web and Ontologies
