Learning to Minimize the Remainder in Supervised Learning
Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

TL;DR
This paper introduces Gradient Adjustment Learning (GAL), a novel method that leverages past training information to minimize approximation remainders, improving model performance across multiple multimedia tasks.
Contribution
The paper proposes GAL, a model- and optimizer-agnostic approach that adjusts gradients to reduce approximation errors during training, enhancing deep learning models.
Findings
GAL improves accuracy in image classification, object detection, and regression.
The method is compatible with various models and optimizers.
Experimental results outperform baseline methods.
Abstract
The learning process of deep learning methods usually updates the model's parameters in multiple iterations. Each iteration can be viewed as the first-order approximation of Taylor's series expansion. The remainder, which consists of higher-order terms, is usually ignored in the learning process for simplicity. This learning scheme empowers various multimedia based applications, such as image retrieval, recommendation system, and video search. Generally, multimedia data (e.g., images) are semantics-rich and high-dimensional, hence the remainders of approximations are possibly non-zero. In this work, we consider the remainder to be informative and study how it affects the learning process. To this end, we propose a new learning approach, namely gradient adjustment learning (GAL), to leverage the knowledge learned from the past training iterations to adjust vanilla gradients, such that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
