How Do Model Export Formats Impact the Development of ML-Enabled Systems? A Case Study on Model Integration
Shreyas Kumar Parida, Ilias Gerostathopoulos, Justus Bogner

TL;DR
This study evaluates how different ML model export formats affect system integration, highlighting ONNX as the most efficient for portability and ease of use, with specific insights into other formats' strengths and challenges.
Contribution
It provides empirical evidence on the impact of five popular ML export formats on system development, guiding practitioners in format selection based on integration efficiency and support.
Findings
ONNX offers the best integration and portability.
SavedModel and TorchScript are convenient in Python but require workarounds otherwise.
Pickle and Joblib are the most challenging to integrate.
Abstract
Machine learning (ML) models are often integrated into ML-enabled systems to provide software functionality that would otherwise be impossible. This integration requires the selection of an appropriate ML model export format, for which many options are available. These formats are crucial for ensuring a seamless integration, and choosing a suboptimal one can negatively impact system development. However, little evidence is available to guide practitioners during the export format selection. We therefore evaluated various model export formats regarding their impact on the development of ML-enabled systems from an integration perspective. Based on the results of a preliminary questionnaire survey (n=17), we designed an extensive embedded case study with two ML-enabled systems in three versions with different technologies. We then analyzed the effect of five popular export formats,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel-Driven Software Engineering Techniques · Business Process Modeling and Analysis · Simulation Techniques and Applications
