A Software Engineering Perspective on Engineering Machine Learning   Systems: State of the Art and Challenges

G\"orkem Giray

arXiv:2012.07919·cs.SE·June 16, 2021

A Software Engineering Perspective on Engineering Machine Learning Systems: State of the Art and Challenges

G\"orkem Giray

PDF

TL;DR

This paper systematically reviews the current state of software engineering research for machine learning systems, highlighting the challenges, lack of mature tools, and the need for further empirical studies in industrial contexts.

Contribution

It provides a comprehensive analysis of existing SE approaches for ML systems, identifying gaps and emphasizing the importance of empirical research and tool development.

Findings

01

Testing is the most researched SE aspect for ML systems.

02

No mature tools or techniques currently exist for most SE aspects.

03

Challenges of ML systems engineering are mainly identified through surveys and interviews.

Abstract

Context: Advancements in machine learning (ML) lead to a shift from the traditional view of software development, where algorithms are hard-coded by humans, to ML systems materialized through learning from data. Therefore, we need to revisit our ways of developing software systems and consider the particularities required by these new types of systems. Objective: The purpose of this study is to systematically identify, analyze, summarize, and synthesize the current state of software engineering (SE) research for engineering ML systems. Method: I performed a systematic literature review (SLR). I systematically selected a pool of 141 studies from SE venues and then conducted a quantitative and qualitative analysis using the data extracted from these studies. Results: The non-deterministic nature of ML systems complicates all SE aspects of engineering ML systems. Despite increasing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.