Information Hidden in Gradients of Regression with Target Noise
Arash Jamshidi, Katsiaryna Haitsiukevich, Kai Puolam\"aki

TL;DR
This paper demonstrates that in linear regression, gradients alone can reveal the data covariance by injecting Gaussian noise to calibrate target noise variance, enabling better optimization and analysis.
Contribution
It introduces a simple variance calibration method that allows gradient covariance to approximate the Hessian, even far from the optimum, with theoretical guarantees.
Findings
Gradient covariance equals data covariance under target noise calibration.
Proper noise calibration ensures accurate Hessian recovery.
Method is practical, robust, and applicable to various optimization tasks.
Abstract
Second-order information -- such as curvature or data covariance -- is critical for optimisation, diagnostics, and robustness. However, in many modern settings, only the gradients are observable. We show that the gradients alone can reveal the Hessian, equalling the data covariance for the linear regression. Our key insight is a simple variance calibration: injecting Gaussian noise so that the total target noise variance equals the batch size ensures that the empirical gradient covariance closely approximates the Hessian, even when evaluated far from the optimum. We provide non-asymptotic operator-norm guarantees under sub-Gaussian inputs. We also show that without such calibration, recovery can fail by an factor. The proposed method is practical (a "set target-noise variance to " rule) and robust (variance suffices to recover up to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
