Evaluating the Stability of Deep Learning Latent Feature Spaces
Ademide O. Mabadeje, Michael J. Pyrcz

TL;DR
This paper presents a comprehensive workflow to evaluate the stability of deep learning-derived latent feature spaces, addressing their invariance to data, model, and parameter variations, with applications across synthetic and real datasets.
Contribution
It introduces a novel methodology with specific metrics for assessing latent space stability, including three stability types and new indicators like adjusted stress and Jaccard dissimilarity.
Findings
Latent spaces exhibit inherent instabilities.
The workflow effectively quantifies and interprets these instabilities.
Stability assessment improves model interpretability and decision-making.
Abstract
High-dimensional datasets present substantial challenges in statistical modeling across various disciplines, necessitating effective dimensionality reduction methods. Deep learning approaches, notable for their capacity to distill essential features from complex data, facilitate modeling, visualization, and compression through reduced dimensionality latent feature spaces, have wide applications from bioinformatics to earth sciences. This study introduces a novel workflow to evaluate the stability of these latent spaces, ensuring consistency and reliability in subsequent analyses. Stability, defined as the invariance of latent spaces to minor data, training realizations, and parameter perturbations, is crucial yet often overlooked. Our proposed methodology delineates three stability types, sample, structural, and inferential, within latent spaces, and introduces a suite of metrics for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Topic Modeling
Methodsk-Means Clustering
