Push it to the Limit: Discover Edge-Cases in Image Data with   Autoencoders

Ilja Manakov; Volker Tresp

arXiv:1910.02713·cs.CV·October 8, 2019

Push it to the Limit: Discover Edge-Cases in Image Data with Autoencoders

Ilja Manakov, Volker Tresp

PDF

Open Access 1 Repo

TL;DR

This paper presents a method using autoencoders and PCA to identify semantic factors and edge-cases in large image datasets by analyzing high-level encodings and their principal components.

Contribution

It introduces a novel approach combining autoencoders and PCA to discover semantic variations and edge-cases in image data.

Findings

01

High and low ends of principal component distributions correspond to specific semantic groups.

02

The method effectively uncovers unwanted edge-cases in real-world datasets.

03

Autoencoders facilitate high-level feature extraction for semantic analysis.

Abstract

In this paper, we focus on the problem of identifying semantic factors of variation in large image datasets. By training a convolutional Autoencoder on the image data, we create encodings, which describe each datapoint at a higher level of abstraction than pixel-space. We then apply Principal Component Analysis to the encodings to disentangle the factors of variation in the data. Sorting the dataset according to the values of individual principal components, we find that samples at the high and low ends of the distribution often share specific semantic characteristics. We refer to these groups of samples as semantic groups. When applied to real-world data, this method can help discover unwanted edge-cases.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IljaManakov/PushItToTheLimit
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Machine Learning and Data Classification · Advanced Image and Video Retrieval Techniques

MethodsSolana Customer Service Number +1-833-534-1729