TransformNet: Self-supervised representation learning through predicting geometric transformations
Sayed Hashim, Muhammad Ali

TL;DR
TransformNet introduces a self-supervised learning approach that trains neural networks to recognize geometric transformations, improving feature learning from limited data without labeled annotations.
Contribution
It proposes a novel self-supervised scheme based on predicting geometric transformations, extending beyond rotations to other transformations, and evaluates its effectiveness on image classification tasks.
Findings
Effective in learning high-level features from limited data
Improves downstream classification accuracy
Versatile across different transformations and models
Abstract
Deep neural networks need a big amount of training data, while in the real world there is a scarcity of data available for training purposes. To resolve this issue unsupervised methods are used for training with limited data. In this report, we describe the unsupervised semantic feature learning approach for recognition of the geometric transformation applied to the input data. The basic concept of our approach is that if someone is unaware of the objects in the images, he/she would not be able to quantitatively predict the geometric transformation that was applied to them. This self supervised scheme is based on pretext task and the downstream task. The pretext classification task to quantify the geometric transformations should force the CNN to learn high-level salient features of objects useful for image classification. In our baseline model, we define image rotations by multiples of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
MethodsConcatenated Skip Connection · Residual Connection · Dense Block · Average Pooling · Softmax · *Communicated@Fast*How Do I Communicate to Expedia? · Residual Block · Max Pooling · Batch Normalization · Dropout
