NOVAK: Unified adaptive optimizer for deep neural networks

Sergii Kavun

arXiv:2601.07876·cs.LG·January 14, 2026

NOVAK: Unified adaptive optimizer for deep neural networks

Sergii Kavun

PDF

Open Access

TL;DR

NOVAK is a comprehensive adaptive optimizer for deep neural networks that combines multiple advanced techniques, offering improved speed, stability, and accuracy across various architectures and datasets.

Contribution

It introduces NOVAK, a unified, modular optimizer integrating adaptive methods, rectified learning rates, and lookahead synchronization with theoretical guarantees and practical speedups.

Findings

01

Outperforms 14 state-of-the-art optimizers on multiple datasets.

02

Achieves high accuracy and robustness on various architectures.

03

Provides theoretical convergence and stability analysis.

Abstract

This work introduces NOVAK, a modular gradient-based optimization algorithm that integrates adaptive moment estimation, rectified learning-rate scheduling, decoupled weight regularization, multiple variants of Nesterov momentum, and lookahead synchronization into a unified, performance-oriented framework. NOVAK adopts a dual-mode architecture consisting of a streamlined fast path designed for production. The optimizer employs custom CUDA kernels that deliver substantial speedups (3-5 for critical operations) while preserving numerical stability under standard stochastic-optimization assumptions. We provide fully developed mathematical formulations for rectified adaptive learning rates, a memory-efficient lookahead mechanism that reduces overhead from O(2p) to O(p + p/k), and the synergistic coupling of complementary optimization components. Theoretical analysis establishes convergence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Advanced Neural Network Applications · Metaheuristic Optimization Algorithms Research