TL;DR
This paper explores the geometric structure of nonlinear least squares models, especially sloppy models, and introduces methods to improve optimization by understanding the model manifold's geometry and using geodesic coordinates.
Contribution
It characterizes the universal geometric features of sloppy models, introduces the model graph to address boundary issues, and proposes geodesic-based coordinates and acceleration to enhance optimization algorithms.
Findings
Model manifolds have universal geometric features like width series and curvatures.
Boundaries of the model manifold cause parameter divergence; the model graph helps mitigate this.
Geodesic coordinates improve the efficiency and success rate of fitting algorithms.
Abstract
Parameter estimation by nonlinear least squares minimization is a common problem with an elegant geometric interpretation: the possible parameter values of a model induce a manifold in the space of data predictions. The minimization problem is then to find the point on the manifold closest to the data. We show that the model manifolds of a large class of models, known as sloppy models, have many universal features; they are characterized by a geometric series of widths, extrinsic curvatures, and parameter-effects curvatures. A number of common difficulties in optimizing least squares problems are due to this common structure. First, algorithms tend to run into the boundaries of the model manifold, causing parameters to diverge or become unphysical. We introduce the model graph as an extension of the model manifold to remedy this problem. We argue that appropriate priors can remove the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
