Practices for Engineering Trustworthy Machine Learning Applications
Alex Serban, Koen van der Blom, Holger Hoos, Joost Visser

TL;DR
This paper identifies and extends a set of operational practices for developing trustworthy ML systems, based on literature review and a global survey, highlighting low adoption levels especially in security practices.
Contribution
It translates high-level trustworthy ML guidelines into 14 concrete, actionable practices and extends an existing ML engineering practices catalogue.
Findings
Low adoption of security practices in ML development
Moderate adoption of explanation and transparency practices
Extended practice catalogue available for community review
Abstract
Following the recent surge in adoption of machine learning (ML), the negative impact that improper use of ML can have on users and society is now also widely recognised. To address this issue, policy makers and other stakeholders, such as the European Commission or NIST, have proposed high-level guidelines aiming to promote trustworthy ML (i.e., lawful, ethical and robust). However, these guidelines do not specify actions to be taken by those involved in building ML systems. In this paper, we argue that guidelines related to the development of trustworthy ML can be translated to operational practices, and should become part of the ML development life cycle. Towards this goal, we ran a multi-vocal literature review, and mined operational practices from white and grey literature. Moreover, we launched a global survey to measure practice adoption and the effects of these practices. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
