Learning Canonical Transformations

Zachary Dulberg; Jonathan Cohen

arXiv:2011.08822·cs.CV·November 18, 2020

Learning Canonical Transformations

Zachary Dulberg, Jonathan Cohen

PDF

Open Access

TL;DR

This paper investigates how neural networks can learn canonical geometric transformations like translation and rotation in pixel space, emphasizing the importance of training diversity and iterative training for out-of-domain generalization.

Contribution

It demonstrates that high training diversity enables translation extrapolation and iterative training improves rotation generalization in neural networks.

Findings

01

High training diversity suffices for translation extrapolation.

02

Iterative training enhances rotation generalization.

03

Neural networks can learn canonical transformations in pixel space.

Abstract

Humans understand a set of canonical geometric transformations (such as translation and rotation) that support generalization by being untethered to any specific object. We explore inductive biases that help a neural network model learn these transformations in pixel space in a way that can generalize out-of-domain. Specifically, we find that high training set diversity is sufficient for the extrapolation of translation to unseen shapes and scales, and that an iterative training scheme achieves significant extrapolation of rotation in time.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMorphological variations and asymmetry · Image Retrieval and Classification Techniques · Image Processing and 3D Reconstruction