ZetA: A Riemann Zeta-Scaled Extension of Adam for Deep Learning

Samiksha BC

arXiv:2508.02719·cs.LG·August 6, 2025

ZetA: A Riemann Zeta-Scaled Extension of Adam for Deep Learning

Samiksha BC

PDF

TL;DR

ZetA introduces a novel optimizer for deep learning that leverages the Riemann zeta function for dynamic gradient scaling, enhancing generalization and robustness over Adam in various image classification tasks.

Contribution

It is the first optimizer to incorporate zeta-based gradient scaling, combining multiple techniques to improve deep learning optimization.

Findings

01

Consistent accuracy improvements over Adam on multiple datasets.

02

Effective in noisy and high-granularity classification tasks.

03

Computationally efficient and robust in experiments.

Abstract

This work introduces ZetA, a novel deep learning optimizer that extends Adam by incorporating dynamic scaling based on the Riemann zeta function. To the best of our knowledge, ZetA is the first optimizer to apply zeta-based gradient scaling within deep learning optimization. The method improves generalization and robustness through a hybrid update mechanism that integrates adaptive damping, cosine similarity-based momentum boosting, entropy-regularized loss, and Sharpness-Aware Minimization (SAM)-style perturbations. Empirical evaluations on SVHN, CIFAR10, CIFAR100, STL10, and noisy CIFAR10 consistently show test accuracy improvements over Adam. All experiments employ a lightweight fully connected network trained for five epochs under mixed-precision settings. The results demonstrate that ZetA is a computationally efficient and robust alternative to Adam, particularly effective in noisy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.