3D Morphable Models as Spatial Transformer Networks

Anil Bas; Patrik Huber; William A. P. Smith; Muhammad Awais; Josef; Kittler

arXiv:1708.07199·cs.CV·April 20, 2018

3D Morphable Models as Spatial Transformer Networks

Anil Bas, Patrik Huber, William A. P. Smith, Muhammad Awais, Josef, Kittler

PDF

1 Repo

TL;DR

This paper introduces a novel neural network module that integrates 3D Morphable Models to improve spatial normalization of 3D pose, occlusion, and pose variations in images, enhancing robustness in face analysis tasks.

Contribution

It extends spatial transformer networks by incorporating 3D Morphable Models, enabling explicit 3D pose normalization and occlusion handling within CNNs.

Findings

01

Localiser learns to fit 3DMM to images with simple geometric loss

02

Robust normalization on images with occlusion and pose variation

03

Effective training on small datasets

Abstract

In this paper, we show how a 3D Morphable Model (i.e. a statistical model of the 3D shape of a class of objects such as faces) can be used to spatially transform input data as a module (a 3DMM-STN) within a convolutional neural network. This is an extension of the original spatial transformer network in that we are able to interpret and normalise 3D pose changes and self-occlusions. The trained localisation part of the network is independently useful since it learns to fit a 3D morphable model to a single image. We show that the localiser can be trained using only simple geometric loss functions on a relatively small dataset yet is able to perform robust normalisation on highly uncontrolled images including occlusion, self-occlusion and large pose changes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anilbas/3DMMasSTN
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Spatial Transformer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam