Rotated Digit Recognition by Variational Autoencoders with Fixed Output Distributions
David Yevick

TL;DR
This paper introduces a modification to variational autoencoders that improves their ability to recognize and classify rotated and distorted digits by using fixed reference outputs during training.
Contribution
The paper proposes a simple change to the VAE objective function, enabling better discrimination of rotated digits even with low-dimensional latent spaces.
Findings
Enhanced digit classification accuracy for rotated MNIST digits
Effective discrimination with low-dimensional latent spaces
Improved robustness to distortions in digit recognition
Abstract
This paper demonstrates that a simple modification of the variational autoencoder (VAE) formalism enables the method to identify and classify rotated and distorted digits. In particular, the conventional objective (cost) function employed during the training process of a VAE both quantifies the agreement between the input and output data records and ensures that the latent space representation of the input data record is statistically generated with an appropriate mean and standard deviation. After training, simulated data realizations are generated by decoding appropriate latent space points. Since, however, standard VAE:s trained on randomly rotated MNIST digits cannot reliably distinguish between different digit classes since the rotated input data is effectively compared to a similarly rotated output data record. In contrast, an alternative implementation in which the objective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Neural Networks and Applications
