Hindsight-Guided Momentum (HGM) Optimizer: An Approach to Adaptive Learning Rate
Krisanu Sarkar

TL;DR
Hindsight-Guided Momentum (HGM) is a novel optimizer that adaptively adjusts learning rates based on directional consistency, improving convergence speed and stability in deep neural network training.
Contribution
HGM introduces a hindsight mechanism that uses cosine similarity to adaptively scale learning rates, enhancing responsiveness to the optimization landscape's geometry.
Findings
Accelerates convergence in smooth regions.
Maintains stability in sharp or noisy regions.
Preserves computational efficiency.
Abstract
We introduce Hindsight-Guided Momentum (HGM), a first-order optimization algorithm that adaptively scales learning rates based on the directional consistency of recent updates. Traditional adaptive methods, such as Adam or RMSprop , adapt learning dynamics using only the magnitude of gradients, often overlooking important geometric cues.Geometric cues refer to directional information, such as the alignment between current gradients and past updates, which reflects the local curvature and consistency of the optimization path. HGM addresses this by incorporating a hindsight mechanism that evaluates the cosine similarity between the current gradient and accumulated momentum. This allows it to distinguish between coherent and conflicting gradient directions, increasing the learning rate when updates align and reducing it in regions of oscillation or noise. The result is a more responsive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research · Neural Networks and Reservoir Computing
MethodsRMSProp · ALIGN
