Solving hybrid machine learning tasks by traversing weight space geodesics
Guruprasad Raghavan, Matt Thomson

TL;DR
This paper introduces a geometric framework based on Riemannian manifolds and geodesic paths in neural network weight space to unify and solve various machine learning tasks such as sparsification, avoiding catastrophic forgetting, and connecting local optima.
Contribution
The authors develop a novel differential geometry-based approach to analyze and optimize neural networks by traversing geodesic paths in weight space, unifying multiple objectives.
Findings
Effective network sparsification using geodesic paths.
Mitigation of catastrophic forgetting through high-performance network paths.
Discovery of high-accuracy routes connecting local optima.
Abstract
Machine learning problems have an intrinsic geometric structure as central objects including a neural network's weight space and the loss function associated with a particular task can be viewed as encoding the intrinsic geometry of a given machine learning problem. Therefore, geometric concepts can be applied to analyze and understand theoretical properties of machine learning strategies as well as to develop new algorithms. In this paper, we address three seemingly unrelated open questions in machine learning by viewing them through a unified framework grounded in differential geometry. Specifically, we view the weight space of a neural network as a manifold endowed with a Riemannian metric that encodes performance on specific tasks. By defining a metric, we can construct geodesic, minimum length, paths in weight space that represent sets of networks of equivalent or near equivalent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis · Domain Adaptation and Few-Shot Learning · 3D Shape Modeling and Analysis
