Rotated Digit Recognition by Variational Autoencoders with Fixed Output   Distributions

David Yevick

arXiv:2206.13388·cs.CV·June 29, 2022

Rotated Digit Recognition by Variational Autoencoders with Fixed Output Distributions

David Yevick

PDF

Open Access

TL;DR

This paper introduces a modification to variational autoencoders that improves their ability to recognize and classify rotated and distorted digits by using fixed reference outputs during training.

Contribution

The paper proposes a simple change to the VAE objective function, enabling better discrimination of rotated digits even with low-dimensional latent spaces.

Findings

01

Enhanced digit classification accuracy for rotated MNIST digits

02

Effective discrimination with low-dimensional latent spaces

03

Improved robustness to distortions in digit recognition

Abstract

This paper demonstrates that a simple modification of the variational autoencoder (VAE) formalism enables the method to identify and classify rotated and distorted digits. In particular, the conventional objective (cost) function employed during the training process of a VAE both quantifies the agreement between the input and output data records and ensures that the latent space representation of the input data record is statistically generated with an appropriate mean and standard deviation. After training, simulated data realizations are generated by decoding appropriate latent space points. Since, however, standard VAE:s trained on randomly rotated MNIST digits cannot reliably distinguish between different digit classes since the rotated input data is effectively compared to a similarly rotated output data record. In contrast, an alternative implementation in which the objective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Neural Networks and Applications