TransformNet: Self-supervised representation learning through predicting   geometric transformations

Sayed Hashim; Muhammad Ali

arXiv:2202.04181·cs.CV·February 10, 2022

TransformNet: Self-supervised representation learning through predicting geometric transformations

Sayed Hashim, Muhammad Ali

PDF

Open Access 1 Repo

TL;DR

TransformNet introduces a self-supervised learning approach that trains neural networks to recognize geometric transformations, improving feature learning from limited data without labeled annotations.

Contribution

It proposes a novel self-supervised scheme based on predicting geometric transformations, extending beyond rotations to other transformations, and evaluates its effectiveness on image classification tasks.

Findings

01

Effective in learning high-level features from limited data

02

Improves downstream classification accuracy

03

Versatile across different transformations and models

Abstract

Deep neural networks need a big amount of training data, while in the real world there is a scarcity of data available for training purposes. To resolve this issue unsupervised methods are used for training with limited data. In this report, we describe the unsupervised semantic feature learning approach for recognition of the geometric transformation applied to the input data. The basic concept of our approach is that if someone is unaware of the objects in the images, he/she would not be able to quantitatively predict the geometric transformation that was applied to them. This self supervised scheme is based on pretext task and the downstream task. The pretext classification task to quantify the geometric transformations should force the CNN to learn high-level salient features of objects useful for image classification. In our baseline model, we define image rotations by multiples of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hashimsayed0/transformnet
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing and 3D Reconstruction · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques

MethodsConcatenated Skip Connection · Residual Connection · Dense Block · Average Pooling · Softmax · *Communicated@Fast*How Do I Communicate to Expedia? · Residual Block · Max Pooling · Batch Normalization · Dropout