Ensemble of ConvNeXt V2 and MaxViT for Long-Tailed CXR Classification   with View-Based Aggregation

Yosuke Yamagishi; Shouhei Hanaoka

arXiv:2410.10710·cs.CV·October 16, 2024

Ensemble of ConvNeXt V2 and MaxViT for Long-Tailed CXR Classification with View-Based Aggregation

Yosuke Yamagishi, Shouhei Hanaoka

PDF

Open Access 1 Repo

TL;DR

This paper presents an ensemble approach combining ConvNeXt V2 and MaxViT models, pretrained on chest X-ray data, with view-based aggregation and asymmetric loss to improve long-tailed CXR classification, achieving top results in MICCAI 2024 challenge.

Contribution

It introduces a novel ensemble method with view-based aggregation and class imbalance handling for long-tailed CXR classification, demonstrating improved accuracy.

Findings

01

Achieved 4th place in Subtask 2 and 5th in Subtask 1 of MICCAI 2024 CXR-LT challenge.

02

Ensemble of ConvNeXt V2 and MaxViT improves classification performance.

03

View-based aggregation and asymmetric loss enhance detection of rare findings.

Abstract

In this work, we present our solution for the MICCAI 2024 CXR-LT challenge, achieving 4th place in Subtask 2 and 5th in Subtask 1. We leveraged an ensemble of ConvNeXt V2 and MaxViT models, pretrained on an external chest X-ray dataset, to address the long-tailed distribution of chest findings. The proposed method combines state-of-the-art image classification techniques, asymmetric loss for handling class imbalance, and view-based prediction aggregation to enhance classification performance. Through experiments, we demonstrate the advantages of our approach in improving both detection accuracy and the handling of the long-tailed distribution in CXR findings. The code is available at https://github.com/yamagishi0824/cxrlt24-multiview-pp.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yamagishi0824/cxrlt24-multiview-pp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing Techniques and Applications · Machine Learning in Bioinformatics · Chemokine receptors and signaling

MethodsConvNeXt