Pixel VQ-VAEs for Improved Pixel Art Representation

Akash Saravanan; Matthew Guzdial

arXiv:2203.12130·cs.CV·September 23, 2022·5 cites

Pixel VQ-VAEs for Improved Pixel Art Representation

Akash Saravanan, Matthew Guzdial

PDF

Open Access 1 Repo

TL;DR

This paper introduces Pixel VQ-VAE, a specialized model designed to effectively learn and represent pixel art, outperforming existing models in embedding quality and downstream task performance.

Contribution

The paper presents a novel Pixel VQ-VAE model tailored for pixel art, addressing the limitations of traditional models in capturing individual pixel importance.

Findings

01

Outperforms other models in embedding quality

02

Achieves better performance on downstream tasks

03

Effectively captures pixel-level details in pixel art

Abstract

Machine learning has had a great deal of success in image processing. However, the focus of this work has largely been on realistic images, ignoring more niche art styles such as pixel art. Additionally, many traditional machine learning models that focus on groups of pixels do not work well with pixel art, where individual pixels are important. We propose the Pixel VQ-VAE, a specialized VQ-VAE model that learns representations of pixel art. We show that it outperforms other models in both the quality of embeddings as well as performance on downstream tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

akashsara/fusion-dance
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAesthetic Perception and Analysis · Visual Attention and Saliency Detection · Generative Adversarial Networks and Image Synthesis

MethodsVQ-VAE