Datasheets for Healthcare AI: A Framework for Transparency and Bias Mitigation
Marjia Siddik, Harshvardhan J. Pandit

TL;DR
This paper introduces the 'Healthcare AI Datasheet', a framework for transparent dataset documentation that aims to mitigate bias and improve ethical AI practices in healthcare by enabling better recognition of dataset issues and supporting regulatory compliance.
Contribution
It proposes a novel, machine-readable datasheet framework for healthcare datasets that enhances transparency, addresses bias, and supports automated risk assessments, filling gaps in existing documentation practices.
Findings
The datasheet framework improves recognition of dataset biases.
It facilitates regulatory compliance and ethical AI development.
Machine-readable format enables automated risk assessments.
Abstract
The use of AI in healthcare has the potential to improve patient care, optimize clinical workflows, and enhance decision-making. However, bias, data incompleteness, and inaccuracies in training datasets can lead to unfair outcomes and amplify existing disparities. This research investigates the current state of dataset documentation practices, focusing on their ability to address these challenges and support ethical AI development. We identify shortcomings in existing documentation methods, which limit the recognition and mitigation of bias, incompleteness, and other issues in datasets. We propose the 'Healthcare AI Datasheet' to address these gaps, a dataset documentation framework that promotes transparency and ensures alignment with regulatory requirements. Additionally, we demonstrate how it can be expressed in a machine-readable format, facilitating its integration with datasets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence
