Momentum-Anchored Multi-Scale Fusion Model for Long-Tailed Chest X-Ray Classification
Duy Hoang Khuong, Duy Nguyen Huu, Ngu Huynh Cong Viet

TL;DR
This paper introduces a momentum-anchored multi-scale fusion model that stabilizes feature representations to improve long-tailed chest X-ray classification, especially for rare pathologies.
Contribution
It proposes a novel EMA-based momentum anchoring mechanism combined with multi-scale spatial fusion to enhance feature stability and classification performance on imbalanced medical datasets.
Findings
Achieves 0.8682 average AUC on ChestX-ray14 dataset.
Significantly improves detection of rare pathologies like Hernia and Pneumonia.
Outperforms existing state-of-the-art methods in long-tailed chest X-ray classification.
Abstract
Chest X-ray classification suffers from severe class imbalance where gradient updates bias toward majority classes, causing feature drift and poor performance on rare but critical pathologies. We propose a Momentum-Anchored Multi-Scale Fusion Network that uses exponential moving averages (EMA) as a temporal anchoring mechanism to stabilize feature representations under long-tailed distributions. Our approach applies selective momentum updates to the final expansion block of an EfficientNet backbone, creating a slowly-evolving reference branch that resists gradient-induced drift while preserving discriminative patterns for minority classes. Combined with multi-scale spatial fusion (, , convolutions), this anchoring strategy maintains representational stability throughout training. On ChestX-ray14, our method achieves 0.8682 average AUC, outperforming…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
