ZACH-ViT: A Zero-Token Vision Transformer with ShuffleStrides Data Augmentation for Robust Lung Ultrasound Classification
Athanasios Angelakis, Amne Mousa, Micah L. A. Heldeweg, Laurens A. Biesheuvel, Mark A. Haaksma, Jasper M. Smit, Pieter R. Tuinman, Paul W. G. Elbers

TL;DR
ZACH-ViT is a novel zero-token vision transformer designed for robust lung ultrasound classification, utilizing permutation-invariant architecture and a new data augmentation method, achieving superior accuracy and efficiency on heterogeneous clinical data.
Contribution
The paper introduces ZACH-ViT, a fully permutation-invariant vision transformer without positional embeddings or [CLS] token, and ShuffleStrides Data Augmentation for improved medical image classification.
Findings
Achieved highest ROC-AUC of 0.79 on lung ultrasound data.
Outperformed nine state-of-the-art baselines.
Trained 1.35x faster with fewer parameters.
Abstract
Differentiating cardiogenic pulmonary oedema (CPE) from non-cardiogenic and structurally normal lungs in lung ultrasound (LUS) videos remains challenging due to the high visual variability of non-cardiogenic inflammatory patterns (NCIP/ARDS-like), interstitial lung disease, and healthy lungs. This heterogeneity complicates automated classification as overlapping B-lines and pleural artefacts are common. We introduce ZACH-ViT (Zero-token Adaptive Compact Hierarchical Vision Transformer), a 0.25 M-parameter Vision Transformer variant that removes both positional embeddings and the [CLS] token, making it fully permutation-invariant and suitable for unordered medical image data. To enhance generalization, we propose ShuffleStrides Data Augmentation (SSDA), which permutes probe-view sequences and frame orders while preserving anatomical validity. ZACH-ViT was evaluated on 380 LUS videos from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUltrasound in Clinical Applications · Phonocardiography and Auscultation Techniques · COVID-19 diagnosis using AI
