Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs

Harsh Patel; Buvaneswari A. Ramanan; Manzoor A. Khan; Thomas Williams,; Brian Friedman; Lawrence Drabeck

arXiv:2405.06835·cs.LG·May 14, 2024·1 cites

Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs

Harsh Patel, Buvaneswari A. Ramanan, Manzoor A. Khan, Thomas Williams,, Brian Friedman, Lawrence Drabeck

PDF

Open Access

TL;DR

This study benchmarks the ability of GPT-3.5-turbo and WizardCoder models to automate MLOps functionalities in ML code, demonstrating GPT-3.5-turbo's superior performance across various tasks such as code adaptation and translation.

Contribution

It introduces a benchmarking framework for evaluating LLMs on MLOps tasks and shows GPT-3.5-turbo's significant advantages over WizardCoder in these functionalities.

Findings

01

GPT-3.5-turbo achieves 55% Pass@3 in model optimization.

02

GPT-3.5-turbo attains 100% accuracy in experiment tracking.

03

GPT-3.5-turbo outperforms WizardCoder in multiple MLOps tasks.

Abstract

This paper explores the possibilities of the current generation of Large Language Models for incorporating Machine Learning Operations (MLOps) functionalities into ML training code bases. We evaluate the performance of OpenAI (gpt-3.5-turbo) and WizardCoder (open-source, 15B parameters) models on the automated accomplishment of various MLOps functionalities in different settings. We perform a benchmarking study that assesses the ability of these models to: (1) adapt existing code samples (Inlining) with component-specific MLOps functionality such as MLflow and Weights & Biases for experiment tracking, Optuna for hyperparameter optimization etc., and (2) perform the task of Translation from one component of an MLOps functionality to another, e.g., translating existing GitPython library based version control code to Data Version Control library based. We also propose three different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Digital Rights Management and Security

MethodsLib