Revisiting Scalable Hessian Diagonal Approximations for Applications in   Reinforcement Learning

Mohamed Elsayed; Homayoon Farrahi; Felix Dangel; A. Rupam Mahmood

arXiv:2406.03276·cs.LG·July 8, 2024·1 cites

Revisiting Scalable Hessian Diagonal Approximations for Applications in Reinforcement Learning

Mohamed Elsayed, Homayoon Farrahi, Felix Dangel, A. Rupam Mahmood

PDF

Open Access 1 Repo

TL;DR

This paper revisits and improves a computationally efficient Hessian diagonal approximation method, HesScale, demonstrating its effectiveness in reinforcement learning and second-order optimization for small networks.

Contribution

HesScale is an improved Hessian diagonal approximation method that is computationally cheap and outperforms existing approaches in small network reinforcement learning tasks.

Findings

01

HesScale provides higher quality Hessian diagonal estimates than alternatives.

02

HesScale accelerates optimization and enhances stability in reinforcement learning.

03

The method shows promise for scaling to larger models in the future.

Abstract

Second-order information is valuable for many applications but challenging to compute. Several works focus on computing or approximating Hessian diagonals, but even this simplification introduces significant additional costs compared to computing a gradient. In the absence of efficient exact computation schemes for Hessian diagonals, we revisit an early approximation scheme proposed by Becker and LeCun (1989, BL89), which has a cost similar to gradients and appears to have been overlooked by the community. We introduce HesScale, an improvement over BL89, which adds negligible extra computation. On small networks, we find that this improvement is of higher quality than all alternatives, even those with theoretical guarantees, such as unbiasedness, while being much cheaper to compute. We use this insight in reinforcement learning problems where small networks are used and demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mohmdelsayed/hesscale
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Stochastic Gradient Optimization Techniques

MethodsFocus