Latent Transformations for Object View Points Synthesis
Sangpil Kim, Nick Winovich, Guang Lin, Karthik Ramani

TL;DR
This paper introduces LTNN, a lightweight, fully-convolutional neural network for real-time object view synthesis, utilizing a novel conditional transformation unit and consistency loss to outperform existing methods across multiple tasks.
Contribution
The paper presents a new latent transformation neural network with a dedicated transformation unit and a consistency loss, improving view synthesis quality and efficiency.
Findings
Outperforms state-of-the-art in multi-view reconstruction, face view synthesis, and object rotation.
Reduces inference computational demand by 30%.
Demonstrates versatility across diverse 3D view synthesis tasks.
Abstract
We propose a fully-convolutional conditional generative model, the latent transformation neural network (LTNN), capable of view synthesis using a light-weight neural network suited for real-time applications. In contrast to existing conditional generative models which incorporate conditioning information via concatenation, we introduce a dedicated network component, the conditional transformation unit (CTU), designed to learn the latent space transformations corresponding to specified target views. In addition, a consistency loss term is defined to guide the network toward learning the desired latent space mappings, a task-divided decoder is constructed to refine the quality of generated views, and an adaptive discriminator is introduced to improve the adversarial training process. The generality of the proposed methodology is demonstrated on a collection of three diverse tasks:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · 3D Shape Modeling and Analysis
