Latent Transformations for Object View Points Synthesis

Sangpil Kim; Nick Winovich; Guang Lin; Karthik Ramani

arXiv:1807.04812·cs.CV·November 29, 2018

Latent Transformations for Object View Points Synthesis

Sangpil Kim, Nick Winovich, Guang Lin, Karthik Ramani

PDF

Open Access

TL;DR

This paper introduces LTNN, a lightweight, fully-convolutional neural network for real-time object view synthesis, utilizing a novel conditional transformation unit and consistency loss to outperform existing methods across multiple tasks.

Contribution

The paper presents a new latent transformation neural network with a dedicated transformation unit and a consistency loss, improving view synthesis quality and efficiency.

Findings

01

Outperforms state-of-the-art in multi-view reconstruction, face view synthesis, and object rotation.

02

Reduces inference computational demand by 30%.

03

Demonstrates versatility across diverse 3D view synthesis tasks.

Abstract

We propose a fully-convolutional conditional generative model, the latent transformation neural network (LTNN), capable of view synthesis using a light-weight neural network suited for real-time applications. In contrast to existing conditional generative models which incorporate conditioning information via concatenation, we introduce a dedicated network component, the conditional transformation unit (CTU), designed to learn the latent space transformations corresponding to specified target views. In addition, a consistency loss term is defined to guide the network toward learning the desired latent space mappings, a task-divided decoder is constructed to refine the quality of generated views, and an adaptive discriminator is introduced to improve the adversarial training process. The generality of the proposed methodology is demonstrated on a collection of three diverse tasks:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · 3D Shape Modeling and Analysis