Latents of latents to delineate pixels: hybrid Matryoshka autoencoder-to-U-Net pairing for segmenting large medical images in GPU-poor and low-data regimes
Tahir Syed, Ariba Khan, Sawera Hanif

TL;DR
This paper introduces a hybrid Matryoshka autoencoder-U-Net architecture that effectively segments high-resolution medical images, especially in low-data and GPU-limited settings, by preserving detailed pixel information.
Contribution
It presents a novel Matryoshka Autoencoder combined with U-Net for improved pixel-level segmentation in medical imaging, demonstrating superior performance over standard U-Net.
Findings
Achieves higher IoU and Dice scores than baseline U-Net.
Effectively segments echocardiographic images with low contrast.
Shows promise for low-resource medical image analysis.
Abstract
Medical images are often high-resolution and lose important detail if downsampled, making pixel-level methods such as semantic segmentation much less efficient if performed on a low-dimensional image. We propose a low-rank Matryoshka projection and a hybrid segmenting architecture that preserves important information while retaining sufficient pixel geometry for pixel-level tasks. We design the Matryoshka Autoencoder (MatAE-U-Net) which combines the hierarchical encoding of the Matryoshka Autoencoder with the spatial reconstruction capabilities of a U-Net decoder, leveraging multi-scale feature extraction and skip connections to enhance accuracy and generalisation. We apply it to the problem of segmenting the left ventricle (LV) in echocardiographic images using the Stanford EchoNet-D dataset, including 1,000 standardised video-mask pairs of cardiac ultrasound videos resized to 112x112…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · AI in cancer detection · Medical Image Segmentation Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Max Pooling · Convolution · U-Net
