Image Semantic Transformation: Faster, Lighter and Stronger

Dasong Li; Jianbo Wang

arXiv:1803.09932·cs.CV·March 28, 2018

Image Semantic Transformation: Faster, Lighter and Stronger

Dasong Li, Jianbo Wang

PDF

Open Access

TL;DR

This paper introduces ISTRC, a novel model that uses FaceNet's Euclidean latent space to perform semantic image transformations and reconstructions efficiently, enabling high-level manipulations with minimal training time.

Contribution

The paper presents ISTRC, a new approach leveraging FaceNet's latent space for semantic image transformations and perfect reconstructions, with rapid training and high accuracy.

Findings

01

Performs 10 high-level semantic transformations

02

Achieves perfect image reconstruction within the circle

03

Requires only 3 hours of training on GTX 1080

Abstract

We propose Image-Semantic-Transformation-Reconstruction-Circle(ISTRC) model, a novel and powerful method using facenet's Euclidean latent space to understand the images. As the name suggests, ISTRC construct the circle, able to perfectly reconstruct images. One powerful Euclidean latent space embedded in ISTRC is FaceNet's last layer with the power of distinguishing and understanding images. Our model will reconstruct the images and manipulate Euclidean latent vectors to achieve semantic transformations and semantic images arthimetic calculations. In this paper, we show that ISTRC performs 10 high-level semantic transformations like "Male and female","add smile","open mouth", "deduct beard or add mustache", "bigger/smaller nose", "make older and younger", "bigger lips", "bigger eyes", "bigger/smaller mouths" and "more attractive". It just takes 3 hours(GTX 1080) to train the models of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Digital Media Forensic Detection