TL;DR
This paper introduces class-informed autoencoders that incorporate label information into the loss function to effectively reduce data complexity, improving feature learning for classification tasks.
Contribution
It proposes three novel autoencoder-based methods—Scorer, Skaler, and Slicer—that leverage class labels to enhance feature extraction for complex data.
Findings
Outperforms four popular unsupervised feature extraction techniques.
Effective in reducing data complexity for classification.
Validated on 27 datasets with diverse complexity metrics.
Abstract
Available data in machine learning applications is becoming increasingly complex, due to higher dimensionality and difficult classes. There exists a wide variety of approaches to measuring complexity of labeled data, according to class overlap, separability or boundary shapes, as well as group morphology. Many techniques can transform the data in order to find better features, but few focus on specifically reducing data complexity. Most data transformation methods mainly treat the dimensionality aspect, leaving aside the available information within class labels which can be useful when classes are somehow complex. This paper proposes an autoencoder-based approach to complexity reduction, using class labels in order to inform the loss function about the adequacy of the generated variables. This leads to three different new feature learners, Scorer, Skaler and Slicer. They are based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
