Embracing Massive Medical Data
Yu-Cheng Chou, Zongwei Zhou, Alan Yuille

TL;DR
This paper introduces an online learning approach for massive medical data that enhances training efficiency and reduces catastrophic forgetting by selecting the most significant data samples based on their uniqueness and uncertainty, improving segmentation accuracy.
Contribution
The paper presents a novel online learning method that selectively trains on significant data samples, addressing challenges of data volume and continual learning in medical imaging.
Findings
Improves Dice score by 15% in segmentation tasks.
Enhances data efficiency in continual training.
Mitigates catastrophic forgetting effectively.
Abstract
As massive medical data become available with an increasing number of scans, expanding classes, and varying sources, prevalent training paradigms -- where AI is trained with multiple passes over fixed, finite datasets -- face significant challenges. First, training AI all at once on such massive data is impractical as new scans/sources/classes continuously arrive. Second, training AI continuously on new scans/sources/classes can lead to catastrophic forgetting, where AI forgets old data as it learns new data, and vice versa. To address these two challenges, we propose an online learning method that enables training AI from massive medical data. Instead of repeatedly training AI on randomly selected data samples, our method identifies the most significant samples for the current AI model based on their data uniqueness and prediction uncertainty, then trains the AI on these selective data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare
