Reducing Bias in Deep Learning Optimization: The RSGDM Approach

Honglin Qin; Hongye Zheng; Bingxing Wang; Zhizhong Wu; Bingyao Liu,; Yuanfang Yang

arXiv:2409.15314·cs.LG·September 25, 2024

Reducing Bias in Deep Learning Optimization: The RSGDM Approach

Honglin Qin, Hongye Zheng, Bingxing Wang, Zhizhong Wu, Bingyao Liu,, Yuanfang Yang

PDF

Open Access

TL;DR

This paper introduces RSGDM, a new optimization algorithm for deep learning that corrects bias and lag in gradient estimation, leading to improved convergence accuracy over traditional SGDM.

Contribution

The paper presents a novel RSGDM algorithm that corrects bias and lag in gradient estimates of SGDM using differential correction, enhancing optimization performance.

Findings

01

RSGDM outperforms SGDM in convergence accuracy on CIFAR datasets.

02

Bias and lag in exponential moving averages negatively impact optimization.

03

Differential correction effectively reduces bias and lag in gradient estimation.

Abstract

Currently, widely used first-order deep learning optimizers include non-adaptive learning rate optimizers and adaptive learning rate optimizers. The former is represented by SGDM (Stochastic Gradient Descent with Momentum), while the latter is represented by Adam. Both of these methods use exponential moving averages to estimate the overall gradient. However, estimating the overall gradient using exponential moving averages is biased and has a lag. This paper proposes an RSGDM algorithm based on differential correction. Our contributions are mainly threefold: 1) Analyze the bias and lag brought by the exponential moving average in the SGDM algorithm. 2) Use the differential estimation term to correct the bias and lag in the SGDM algorithm, proposing the RSGDM algorithm. 3) Experiments on the CIFAR datasets have proven that our RSGDM algorithm is superior to the SGDM algorithm in terms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBIM and Construction Integration