Learning Compact Features via In-Training Representation Alignment
Xin Li, Xiangrui Li, Deng Pan, Yao Qiang, and Dongxiao Zhu

TL;DR
This paper introduces In-Training Representation Alignment (ITRA), a method that explicitly aligns feature distributions during training to produce compact, robust features, reducing variance and improving classification performance.
Contribution
ITRA is a novel training technique that explicitly aligns feature distributions across mini-batches to enhance feature compactness and stability during neural network training.
Findings
ITRA improves feature compactness and reduces overfitting.
ITRA achieves superior performance on image and text classification tasks.
ITRA demonstrates robustness across different modalities.
Abstract
Deep neural networks (DNNs) for supervised learning can be viewed as a pipeline of the feature extractor (i.e., last hidden layer) and a linear classifier (i.e., output layer) that are trained jointly with stochastic gradient descent (SGD) on the loss function (e.g., cross-entropy). In each epoch, the true gradient of the loss function is estimated using a mini-batch sampled from the training set and model parameters are then updated with the mini-batch gradients. Although the latter provides an unbiased estimation of the former, they are subject to substantial variances derived from the size and number of sampled mini-batches, leading to noisy and jumpy updates. To stabilize such undesirable variance in estimating the true gradients, we propose In-Training Representation Alignment (ITRA) that explicitly aligns feature distributions of two different mini-batches with a matching loss in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Advanced Neural Network Applications
MethodsStochastic Gradient Descent
