How Does Pruning Impact Long-Tailed Multi-Label Medical Image Classifiers?
Gregory Holste, Ziyu Jiang, Ajay Jaiswal, Maria Hanna, Shlomo, Minkowitz, Alan C. Legasto, Joanna G. Escalon, Sharon Steinberger, Mark, Bittman, Thomas C. Shen, Ying Ding, Ronald M. Summers, George Shih, Yifan, Peng, and Zhangyang Wang

TL;DR
This paper investigates how pruning affects the performance and behavior of deep neural networks in diagnosing thorax diseases from chest X-rays, especially focusing on long-tailed, multi-label datasets in medical imaging.
Contribution
It provides the first analysis of pruning's impact on multi-label medical image classifiers, identifying disease-specific effects and human-perceived differences in pruned versus unpruned models.
Findings
Pruning affects certain diseases more than others, influenced by disease frequency and co-occurrence.
Radiologists perceive pruning-identified exemplars as noisier and more difficult to diagnose.
Pruned models show specific patterns of disagreement with uncompressed models, highlighting potential risks in clinical deployment.
Abstract
Pruning has emerged as a powerful technique for compressing deep neural networks, reducing memory usage and inference time without significantly affecting overall performance. However, the nuanced ways in which pruning impacts model behavior are not well understood, particularly for long-tailed, multi-label datasets commonly found in clinical settings. This knowledge gap could have dangerous implications when deploying a pruned model for diagnosis, where unexpected model behavior could impact patient well-being. To fill this gap, we perform the first analysis of pruning's effect on neural networks trained to diagnose thorax diseases from chest X-rays (CXRs). On two large CXR datasets, we examine which diseases are most affected by pruning and characterize class "forgettability" based on disease frequency and co-occurrence behavior. Further, we identify individual CXRs where uncompressed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Radiomics and Machine Learning in Medical Imaging · Machine Learning in Healthcare
MethodsPruning
