ZENITH: Automated Gradient Norm Informed Stochastic Optimization

Dhrubo Saha

arXiv:2601.15212·cs.LG·January 22, 2026

ZENITH: Automated Gradient Norm Informed Stochastic Optimization

Dhrubo Saha

PDF

Open Access

TL;DR

ZENITH is a novel optimizer that adaptively adjusts learning rates based on gradient norm evolution, leading to improved accuracy and efficiency in computer vision tasks without additional overhead.

Contribution

The paper introduces ZENITH, a gradient norm-informed optimizer that automatically adapts learning rates, outperforming existing methods in accuracy, efficiency, and compatibility with regularization.

Findings

01

Higher test accuracy than baselines across CNN architectures.

02

Faster training times in image classification tasks.

03

Superior mAP in object detection, keypoint detection, and segmentation.

Abstract

Training deep computer vision models requires manual oversight or hyperparameter tuning of the learning rate (LR) schedule. While existing adaptive optimizers schedule the LR automatically, they suffer from computational and memory overhead, incompatibility with regularization, and suboptimal LR choices. In this work, we introduce the ZENITH (Zero-overhead Evolution using Norm-Informed Training History) optimizer, which adapts the LR using the temporal evolution of the gradient norm. Image classification experiments spanning 6 CNN architectures and 6 benchmarks demonstrate that ZENITH achieves higher test accuracy in lower wall-clock time than baselines. It also yielded superior mAP in object detection, keypoint detection, and instance segmentation on MS COCO using the R-CNN family of models. Furthermore, its compatibility with regularization enables even better generalization.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Human Pose and Action Recognition