Collaboration Challenges in Building ML-Enabled Systems: Communication, Documentation, Engineering, and Process
Nadia Nahar, Shurui Zhou, Grace Lewis, Christian K\"astner

TL;DR
This paper identifies key collaboration challenges in building ML-enabled systems, highlighting issues in communication, documentation, engineering, and process, based on interviews with practitioners, and offers recommendations to address them.
Contribution
It provides an empirical analysis of collaboration challenges in ML system development and proposes practical recommendations for improvement.
Findings
Communication, documentation, engineering, and process are major challenges.
Teams face difficulties in requirements, data, and integration points.
Recommendations help improve collaboration and system quality.
Abstract
The introduction of machine learning (ML) components in software projects has created the need for software engineers to collaborate with data scientists and other specialists. While collaboration can always be challenging, ML introduces additional challenges with its exploratory model development process, additional skills and knowledge needed, difficulties testing ML systems, need for continuous evolution and monitoring, and non-traditional quality requirements such as fairness and explainability. Through interviews with 45 practitioners from 28 organizations, we identified key collaboration challenges that teams face when building and deploying ML systems into production. We report on common collaboration points in the development of production ML systems for requirements, data, and integration, as well as corresponding team patterns and challenges. We find that most of these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence · Software Engineering Research · Data Quality and Management
