Reducing Data Complexity using Autoencoders with Class-informed Loss   Functions

David Charte; Francisco Charte; Francisco Herrera

arXiv:2111.06142·cs.LG·November 12, 2021

Reducing Data Complexity using Autoencoders with Class-informed Loss Functions

David Charte, Francisco Charte, Francisco Herrera

PDF

1 Repo

TL;DR

This paper introduces class-informed autoencoders that incorporate label information into the loss function to effectively reduce data complexity, improving feature learning for classification tasks.

Contribution

It proposes three novel autoencoder-based methods—Scorer, Skaler, and Slicer—that leverage class labels to enhance feature extraction for complex data.

Findings

01

Outperforms four popular unsupervised feature extraction techniques.

02

Effective in reducing data complexity for classification.

03

Validated on 27 datasets with diverse complexity metrics.

Abstract

Available data in machine learning applications is becoming increasingly complex, due to higher dimensionality and difficult classes. There exists a wide variety of approaches to measuring complexity of labeled data, according to class overlap, separability or boundary shapes, as well as group morphology. Many techniques can transform the data in order to find better features, but few focus on specifically reducing data complexity. Most data transformation methods mainly treat the dimensionality aspect, leaving aside the available information within class labels which can be useful when classes are somehow complex. This paper proposes an autoencoder-based approach to complexity reduction, using class labels in order to inform the loss function about the adequacy of the generated variables. This leads to three different new feature learners, Scorer, Skaler and Slicer. They are based on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ari-dasci/S-reducing-complexity
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.