Reproducibility in Machine Learning for Health

Matthew B.A. McDermott (1); Shirly Wang (2); Nikki Marinsek (3),; Rajesh Ranganath (4); Marzyeh Ghassemi (2; 5); Luca Foschini (3) ((1); Massachusetts Institute of Technology; (2) University of Toronto; (3); Evidation Health; Inc.; (4) New York University; (5) Vector Institute)

arXiv:1907.01463·cs.LG·July 3, 2019·29 cites

Reproducibility in Machine Learning for Health

Matthew B.A. McDermott (1), Shirly Wang (2), Nikki Marinsek (3),, Rajesh Ranganath (4), Marzyeh Ghassemi (2, 5), Luca Foschini (3) ((1), Massachusetts Institute of Technology, (2) University of Toronto, (3), Evidation Health, Inc., (4) New York University, (5) Vector Institute)

PDF

Open Access

TL;DR

This paper systematically evaluates reproducibility issues in machine learning for health, highlighting poor data and code accessibility, and proposes recommendations to improve research transparency and reliability.

Contribution

It provides a comprehensive assessment of reproducibility in ML4H research and offers actionable guidelines to enhance transparency and reproducibility in the field.

Findings

01

ML4H compares poorly to established ML fields in reproducibility.

02

Data and code accessibility are significantly lacking in ML4H papers.

03

Recommendations are proposed to improve reproducibility in the field.

Abstract

Machine learning algorithms designed to characterize, monitor, and intervene on human health (ML4H) are expected to perform safely and reliably when operating at scale, potentially outside strict human supervision. This requirement warrants a stricter attention to issues of reproducibility than other fields of machine learning. In this work, we conduct a systematic evaluation of over 100 recently published ML4H research papers along several dimensions related to reproducibility. We find that the field of ML4H compares poorly to more established machine learning fields, particularly concerning data and code accessibility. Finally, drawing from success in other fields of science, we propose recommendations to data providers, academic publishers, and the ML4H research community in order to promote reproducible research moving forward.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Explainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education