Increasing diversity of omni-directional images generated from single   image using cGAN based on MLPMixer

Atsuya Nakata; Ryuto Miyazaki; Takao Yamanaka

arXiv:2309.08129·eess.IV·September 18, 2023

Increasing diversity of omni-directional images generated from single image using cGAN based on MLPMixer

Atsuya Nakata, Ryuto Miyazaki, Takao Yamanaka

PDF

Open Access 1 Repo

TL;DR

This paper introduces a MLPMixer-based method for generating diverse omni-directional images from a single photo, overcoming CNN limitations by capturing long-range dependencies and reducing memory use.

Contribution

The paper presents a novel MLPMixer-based approach for omni-directional image generation, improving diversity and efficiency over traditional CNN-based methods.

Findings

01

Achieved higher diversity in generated images.

02

Reduced memory and computational requirements.

03

Competitive performance compared to CNN-based models.

Abstract

This paper proposes a novel approach to generating omni-directional images from a single snapshot picture. The previous method has relied on the generative adversarial networks based on convolutional neural networks (CNN). Although this method has successfully generated omni-directional images, CNN has two drawbacks for this task. First, since a convolutional layer only processes a local area, it is difficult to propagate the information of an input snapshot picture embedded in the center of the omni-directional image to the edges of the image. Thus, the omni-directional images created by the CNN-based generator tend to have less diversity at the edges of the generated images, creating similar scene images. Second, the CNN-based model requires large video memory in graphics processing units due to the nature of the deep structure in CNN since shallow-layer networks only receives signals…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

islab-sophia/odigen-mlpmixer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Image and Video Stabilization