Dataset-Adaptive Dimensionality Reduction
Hyeon Jeon, Jeongin Park, Soohyun Lee, Dae Hyun Kim, Sungbok Shin, Jinwook Seo

TL;DR
This paper introduces a dataset-adaptive method for optimizing dimensionality reduction by using structural complexity metrics to predict the necessary embedding dimensions, reducing trial-and-error and computational costs.
Contribution
It proposes a novel set of structural complexity metrics that guide the selection of optimal DR techniques and hyperparameters based on dataset complexity.
Findings
Metrics accurately approximate dataset complexity
The adaptive workflow reduces optimization trials
Efficiency improves without loss of accuracy
Abstract
Selecting the appropriate dimensionality reduction (DR) technique and determining its optimal hyperparameter settings that maximize the accuracy of the output projections typically involves extensive trial and error, often resulting in unnecessary computational overhead. To address this challenge, we propose a dataset-adaptive approach to DR optimization guided by structural complexity metrics. These metrics quantify the intrinsic complexity of a dataset, predicting whether higher-dimensional spaces are necessary to represent it accurately. Since complex datasets are often inaccurately represented in two-dimensional projections, leveraging these metrics enables us to predict the maximum achievable accuracy of DR techniques for a given dataset, eliminating redundant trials in optimizing DR. We introduce the design and theoretical foundations of these structural complexity metrics. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
