Data Augmentation in High Dimensional Low Sample Size Setting Using a Geometry-Based Variational Autoencoder
Cl\'ement Chadebec, Elina Thibeau-Sutre, Ninon Burgos, St\'ephanie, Allassonni\`ere

TL;DR
This paper introduces a geometry-based variational autoencoder for data augmentation in high-dimensional, low-sample-size scenarios, significantly improving classification accuracy especially in medical imaging with small datasets.
Contribution
It presents a novel VAE approach that models the latent space as a Riemannian manifold and a new sample generation scheme, enhancing data augmentation reliability in HDLSS settings.
Findings
Improves classification accuracy on small datasets
Enhances robustness across classifiers and data sizes
Validates effectiveness on medical imaging data
Abstract
In this paper, we propose a new method to perform data augmentation in a reliable way in the High Dimensional Low Sample Size (HDLSS) setting using a geometry-based variational autoencoder. Our approach combines a proper latent space modeling of the VAE seen as a Riemannian manifold with a new generation scheme which produces more meaningful samples especially in the context of small data sets. The proposed method is tested through a wide experimental study where its robustness to data sets, classifiers and training samples size is stressed. It is also validated on a medical imaging classification task on the challenging ADNI database where a small number of 3D brain MRIs are considered and augmented using the proposed VAE framework. In each case, the proposed method allows for a significant and reliable gain in the classification metrics. For instance, balanced accuracy jumps from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · AI in cancer detection · Medical Image Segmentation Techniques
