Graph-based Isometry Invariant Representation Learning

Renata Khasanova; Pascal Frossard

arXiv:1703.00356·cs.CV·March 2, 2017·34 cites

Graph-based Isometry Invariant Representation Learning

Renata Khasanova, Pascal Frossard

PDF

Open Access

TL;DR

This paper introduces TIGraNet, a graph-based neural network that learns features inherently invariant to isometric transformations like rotation and translation, improving classification robustness on transformed images.

Contribution

The paper proposes a novel graph-based network architecture that replaces traditional convolution and pooling with spectral graph operations to achieve transformation invariance.

Findings

01

High accuracy on rotated and translated images

02

Enhanced robustness to geometric transformations

03

Maintains performance with limited training data

Abstract

Learning transformation invariant representations of visual data is an important problem in computer vision. Deep convolutional networks have demonstrated remarkable results for image and video classification tasks. However, they have achieved only limited success in the classification of images that undergo geometric transformations. In this work we present a novel Transformation Invariant Graph-based Network (TIGraNet), which learns graph-based features that are inherently invariant to isometric transformations such as rotation and translation of input images. In particular, images are represented as signals on graphs, which permits to replace classical convolution and pooling layers in deep networks with graph spectral convolution and dynamic graph pooling layers that together contribute to invariance to isometric transformations. Our experiments show high performance on rotated and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Human Pose and Action Recognition · Multimodal Machine Learning Applications