BDS-Adam optimizer integrating adaptive variance rectification with semi-adaptive gradient smoothing
Yichuan Shao, Shiqian Weng, Haijing Sun, Qian Gao, Le Zhang, Zhiqiang Mao, Shuai Xu, Zhitao Zhang, Lei Xing

TL;DR
This paper introduces BDS-Adam, an improved Adam optimizer that enhances training stability and accuracy by combining adaptive gradient reshaping and smoothing.
Contribution
The novel dual-path framework integrates adaptive gradient reshaping and semi-adaptive smoothing for improved optimization.
Findings
BDS-Adam achieves 9.27% higher test accuracy on CIFAR-10 compared to Adam.
The optimizer shows improved stability and convergence in non-convex settings.
Test accuracy on a gastric pathology image dataset improves by 3.00%.
Abstract
In this work, an enhanced variant of the Adam optimizer, termed BDS-Adam, is proposed to address two critical limitations of the original Adam algorithm: biased gradient estimation and training instability during early optimization. To overcome these issues, a dual-path framework is adopted. In the first path, a nonlinear gradient mapping module (adaptive reshaping of raw gradients using hyperbolic tangent) is applied to adaptively reshape raw gradients, enabling the optimizer to better capture local geometric structures. In the second path, a semi-adaptive gradient smoothing controller–based on real-time gradient variance–is incorporated to suppress abrupt parameter updates and stabilize training dynamics. These two outputs are integrated through a gradient fusion mechanism (combining smoothed and transformed gradients before updates), in which smoothed and transformed gradients are…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetaheuristic Optimization Algorithms Research · Stochastic Gradient Optimization Techniques · Fractional Differential Equations Solutions
