Learning Compact Features via In-Training Representation Alignment

Xin Li; Xiangrui Li; Deng Pan; Yao Qiang; and Dongxiao Zhu

arXiv:2211.13332·cs.LG·November 28, 2022

Learning Compact Features via In-Training Representation Alignment

Xin Li, Xiangrui Li, Deng Pan, Yao Qiang, and Dongxiao Zhu

PDF

Open Access 1 Video

TL;DR

This paper introduces In-Training Representation Alignment (ITRA), a method that explicitly aligns feature distributions during training to produce compact, robust features, reducing variance and improving classification performance.

Contribution

ITRA is a novel training technique that explicitly aligns feature distributions across mini-batches to enhance feature compactness and stability during neural network training.

Findings

01

ITRA improves feature compactness and reduces overfitting.

02

ITRA achieves superior performance on image and text classification tasks.

03

ITRA demonstrates robustness across different modalities.

Abstract

Deep neural networks (DNNs) for supervised learning can be viewed as a pipeline of the feature extractor (i.e., last hidden layer) and a linear classifier (i.e., output layer) that are trained jointly with stochastic gradient descent (SGD) on the loss function (e.g., cross-entropy). In each epoch, the true gradient of the loss function is estimated using a mini-batch sampled from the training set and model parameters are then updated with the mini-batch gradients. Although the latter provides an unbiased estimation of the former, they are subject to substantial variances derived from the size and number of sampled mini-batches, leading to noisy and jumpy updates. To stabilize such undesirable variance in estimating the true gradients, we propose In-Training Representation Alignment (ITRA) that explicitly aligns feature distributions of two different mini-batches with a matching loss in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Learning Compact Features via In-Training Representation Alignment· underline

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Advanced Neural Network Applications

MethodsStochastic Gradient Descent