How Does Pruning Impact Long-Tailed Multi-Label Medical Image   Classifiers?

Gregory Holste; Ziyu Jiang; Ajay Jaiswal; Maria Hanna; Shlomo; Minkowitz; Alan C. Legasto; Joanna G. Escalon; Sharon Steinberger; Mark; Bittman; Thomas C. Shen; Ying Ding; Ronald M. Summers; George Shih; Yifan; Peng; and Zhangyang Wang

arXiv:2308.09180·cs.CV·July 31, 2024

How Does Pruning Impact Long-Tailed Multi-Label Medical Image Classifiers?

Gregory Holste, Ziyu Jiang, Ajay Jaiswal, Maria Hanna, Shlomo, Minkowitz, Alan C. Legasto, Joanna G. Escalon, Sharon Steinberger, Mark, Bittman, Thomas C. Shen, Ying Ding, Ronald M. Summers, George Shih, Yifan, Peng, and Zhangyang Wang

PDF

Open Access 1 Repo

TL;DR

This paper investigates how pruning affects the performance and behavior of deep neural networks in diagnosing thorax diseases from chest X-rays, especially focusing on long-tailed, multi-label datasets in medical imaging.

Contribution

It provides the first analysis of pruning's impact on multi-label medical image classifiers, identifying disease-specific effects and human-perceived differences in pruned versus unpruned models.

Findings

01

Pruning affects certain diseases more than others, influenced by disease frequency and co-occurrence.

02

Radiologists perceive pruning-identified exemplars as noisier and more difficult to diagnose.

03

Pruned models show specific patterns of disagreement with uncompressed models, highlighting potential risks in clinical deployment.

Abstract

Pruning has emerged as a powerful technique for compressing deep neural networks, reducing memory usage and inference time without significantly affecting overall performance. However, the nuanced ways in which pruning impacts model behavior are not well understood, particularly for long-tailed, multi-label datasets commonly found in clinical settings. This knowledge gap could have dangerous implications when deploying a pruned model for diagnosis, where unexpected model behavior could impact patient well-being. To fill this gap, we perform the first analysis of pruning's effect on neural networks trained to diagnose thorax diseases from chest X-rays (CXRs). On two large CXR datasets, we examine which diseases are most affected by pruning and characterize class "forgettability" based on disease frequency and co-occurrence behavior. Further, we identify individual CXRs where uncompressed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vita-group/prunecxr
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCOVID-19 diagnosis using AI · Radiomics and Machine Learning in Medical Imaging · Machine Learning in Healthcare

MethodsPruning