From PhysioNet to Foundation Models -- A History and Potential Futures
Gari D. Clifford

TL;DR
This paper reviews the evolution of medical data sharing, highlights PhysioNet's role in cardiology research, and discusses future challenges and opportunities in large-scale physiological data and AI models.
Contribution
It provides a historical overview of PhysioNet, identifies future directions for physiological databases, and discusses open access, AI, and reproducibility challenges.
Findings
PhysioNet has been central to cardiology data sharing for 25 years.
Open access and public challenges promote scientific progress.
Future directions include foundation models, edge computing, and sustainable AI.
Abstract
Over the last 35 years, the sharing of medical data and models for research has evolved from sneakernet to the internet - from mailing magnetic tapes and compact discs of a handful of well-curated recordings, to the high-speed download of relatively comprehensive hospital databases. More recently, the fervor around the potential for modern machine learning and 'AI' to catapult us into the next industrial revolution has led to a seemingly insatiable desire to pump almost any source of data into large models. Although this has great potential, it also presents a whole set of new challenges. In this article I examine these trends over the last 30 years, drawing on examples from cardiology, one of the oldest data-intensive fields that is undergoing a renaissance via machine learning. From the early days of computerized cardiology, the Research Resource for Complex Physiologic Signals…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
