NOVAK: Unified adaptive optimizer for deep neural networks
Sergii Kavun

TL;DR
NOVAK is a comprehensive adaptive optimizer for deep neural networks that combines multiple advanced techniques, offering improved speed, stability, and accuracy across various architectures and datasets.
Contribution
It introduces NOVAK, a unified, modular optimizer integrating adaptive methods, rectified learning rates, and lookahead synchronization with theoretical guarantees and practical speedups.
Findings
Outperforms 14 state-of-the-art optimizers on multiple datasets.
Achieves high accuracy and robustness on various architectures.
Provides theoretical convergence and stability analysis.
Abstract
This work introduces NOVAK, a modular gradient-based optimization algorithm that integrates adaptive moment estimation, rectified learning-rate scheduling, decoupled weight regularization, multiple variants of Nesterov momentum, and lookahead synchronization into a unified, performance-oriented framework. NOVAK adopts a dual-mode architecture consisting of a streamlined fast path designed for production. The optimizer employs custom CUDA kernels that deliver substantial speedups (3-5 for critical operations) while preserving numerical stability under standard stochastic-optimization assumptions. We provide fully developed mathematical formulations for rectified adaptive learning rates, a memory-efficient lookahead mechanism that reduces overhead from O(2p) to O(p + p/k), and the synergistic coupling of complementary optimization components. Theoretical analysis establishes convergence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Neural Network Applications · Metaheuristic Optimization Algorithms Research
