Arc Gradient Descent: A Geometrically Motivated Gradient Descent-based Optimiser with Phase-Aware, User-Controlled Step Dynamics (proof-of-concept)
Nikhil Verma, Joonas Linnosmaa, Leonardo Espinosa-Leal, Napat Vajragupta

TL;DR
This paper introduces ArcGD, a geometrically motivated gradient descent optimizer that demonstrates superior performance and generalization on challenging non-convex functions and real-world image classification tasks, outperforming existing optimizers.
Contribution
The paper presents ArcGD, a novel phase-aware, user-controlled optimizer with unique step dynamics, and provides empirical evidence of its effectiveness across diverse benchmarks and deep learning architectures.
Findings
ArcGD outperforms Adam and other optimizers on non-convex functions.
ArcGD achieves higher accuracy on CIFAR-10 across multiple architectures.
ArcGD shows resistance to overfitting and maintains improvement with extended training.
Abstract
The paper presents the formulation, implementation, and evaluation of the ArcGD optimiser. The evaluation is conducted initially on a non-convex benchmark function and subsequently on a real-world ML dataset. The initial comparative study using the Adam optimiser is conducted on a stochastic variant of the highly non-convex and notoriously challenging Rosenbrock function, renowned for its narrow, curved valley, across dimensions ranging from 2D to 1000D and an extreme case of 50,000D. Two configurations were evaluated to eliminate learning-rate bias: (i) both using ArcGD's effective learning rate and (ii) both using Adam's default learning rate. ArcGD consistently outperformed Adam under the first setting and, although slower under the second, achieved superior final solutions in most cases. In the second evaluation, ArcGD is evaluated against state-of-the-art optimizers (Adam, AdamW,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Optical Polarization and Ellipsometry · Stochastic Gradient Optimization Techniques
