Transform Invariant Auto-encoder
Tadashi Matsuo, Hiroya Fukuhara, Nobutaka Shimada

TL;DR
The paper introduces a transform invariant auto-encoder that separates spatial transform parameters from descriptors, enabling shift-invariant feature extraction and transform parameter inference without labeled data.
Contribution
It proposes a novel auto-encoder framework that achieves invariance to spatial shifts and infers transform parameters, applicable to various auto-encoders without additional modules or supervision.
Findings
Achieves shift-invariant descriptors for spatial subpatterns
Can infer shift parameters to restore inputs
Demonstrates application in robot hand imitation
Abstract
The auto-encoder method is a type of dimensionality reduction method. A mapping from a vector to a descriptor that represents essential information can be automatically generated from a set of vectors without any supervising information. However, an image and its spatially shifted version are encoded into different descriptors by an existing ordinary auto-encoder because each descriptor includes a spatial subpattern and its position. To generate a descriptor representing a spatial subpattern in an image, we need to normalize its spatial position in the images prior to training an ordinary auto-encoder; however, such a normalization is generally difficult for images without obvious standard positions. We propose a transform invariant auto-encoder and an inference model of transform parameters. By the proposed method, we can separate an input into a transform invariant descriptor and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
