Transform Invariant Auto-encoder

Tadashi Matsuo; Hiroya Fukuhara; Nobutaka Shimada

arXiv:1709.03754·cs.CV·September 13, 2017

Transform Invariant Auto-encoder

Tadashi Matsuo, Hiroya Fukuhara, Nobutaka Shimada

PDF

Open Access

TL;DR

The paper introduces a transform invariant auto-encoder that separates spatial transform parameters from descriptors, enabling shift-invariant feature extraction and transform parameter inference without labeled data.

Contribution

It proposes a novel auto-encoder framework that achieves invariance to spatial shifts and infers transform parameters, applicable to various auto-encoders without additional modules or supervision.

Findings

01

Achieves shift-invariant descriptors for spatial subpatterns

02

Can infer shift parameters to restore inputs

03

Demonstrates application in robot hand imitation

Abstract

The auto-encoder method is a type of dimensionality reduction method. A mapping from a vector to a descriptor that represents essential information can be automatically generated from a set of vectors without any supervising information. However, an image and its spatially shifted version are encoded into different descriptors by an existing ordinary auto-encoder because each descriptor includes a spatial subpattern and its position. To generate a descriptor representing a spatial subpattern in an image, we need to normalize its spatial position in the images prior to training an ordinary auto-encoder; however, such a normalization is generally difficult for images without obvious standard positions. We propose a transform invariant auto-encoder and an inference model of transform parameters. By the proposed method, we can separate an input into a transform invariant descriptor and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition