An investigation of challenges encountered when specifying training data and runtime monitors for safety critical ML applications
Hans-Martin Heyn, Eric Knauss, Iswarya Malleswaran, Shruthi, Dinakaran

TL;DR
This study explores the key challenges practitioners face when specifying training data and runtime monitors for safety-critical ML systems, based on interviews with industry experts, and offers recommendations to address these issues.
Contribution
It identifies 17 underlying challenges and their interconnections in specifying training data and runtime monitoring for critical ML applications, providing targeted recommendations.
Findings
17 challenges identified across 6 groups
Interconnections between challenges discovered
Recommendations proposed to address root causes
Abstract
Context and motivation: The development and operation of critical software that contains machine learning (ML) models requires diligence and established processes. Especially the training data used during the development of ML models have major influences on the later behaviour of the system. Runtime monitors are used to provide guarantees for that behaviour. Question / problem: We see major uncertainty in how to specify training data and runtime monitoring for critical ML models and by this specifying the final functionality of the system. In this interview-based study we investigate the underlying challenges for these difficulties. Principal ideas/results: Based on ten interviews with practitioners who develop ML models for critical applications in the automotive and telecommunication sector, we identified 17 underlying challenges in 6 challenge groups that relate to the challenge of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software System Performance and Reliability · Software Testing and Debugging Techniques
