# BDS-Adam optimizer integrating adaptive variance rectification with semi-adaptive gradient smoothing

**Authors:** Yichuan Shao, Shiqian Weng, Haijing Sun, Qian Gao, Le Zhang, Zhiqiang Mao, Shuai Xu, Zhitao Zhang, Lei Xing

PMC · DOI: 10.1038/s41598-025-20788-y · 2025-10-22

## TL;DR

This paper introduces BDS-Adam, an improved Adam optimizer that enhances training stability and accuracy by combining adaptive gradient reshaping and smoothing.

## Contribution

The novel dual-path framework integrates adaptive gradient reshaping and semi-adaptive smoothing for improved optimization.

## Key findings

- BDS-Adam achieves 9.27% higher test accuracy on CIFAR-10 compared to Adam.
- The optimizer shows improved stability and convergence in non-convex settings.
- Test accuracy on a gastric pathology image dataset improves by 3.00%.

## Abstract

In this work, an enhanced variant of the Adam optimizer, termed BDS-Adam, is proposed to address two critical limitations of the original Adam algorithm: biased gradient estimation and training instability during early optimization. To overcome these issues, a dual-path framework is adopted. In the first path, a nonlinear gradient mapping module (adaptive reshaping of raw gradients using hyperbolic tangent) is applied to adaptively reshape raw gradients, enabling the optimizer to better capture local geometric structures. In the second path, a semi-adaptive gradient smoothing controller–based on real-time gradient variance–is incorporated to suppress abrupt parameter updates and stabilize training dynamics. These two outputs are integrated through a gradient fusion mechanism (combining smoothed and transformed gradients before updates), in which smoothed and transformed gradients are combined prior to parameter updates. Moreover, an adaptive second-order moment correction technique is employed to mitigate cold-start effects caused by inaccurate variance estimates in the early training phase. A convergence analysis under non-convex settings is provided, and it is theoretically demonstrated that the expected gradient norm is bounded under standard assumptions, indicating improved robustness and long-term stability. This adaptive bias-correction formulation further improves training stability. Empirical evaluations on three benchmark datasets–CIFAR-10, MNIST, and a gastric pathology image dataset–reveal test accuracy improvements of 9.27%, 0.08%, and 3.00%, respectively, compared to Adam. These results confirm that the proposed dual-mechanism optimizer effectively enhances both convergence speed and generalization performance across diverse tasks.

## Full-text entities

- **Diseases:** gastric (MESH:D013272), hemorrhoids (MESH:D006484), polyps (MESH:D011127), ulcers (MESH:D014456), gastric cancer (MESH:D013274)
- **Cell lines:** -10 — Mus musculus (Mouse), Hybridoma (CVCL_C4R4)

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12546589/full.md

---
Source: https://tomesphere.com/paper/PMC12546589