Sparsifying networks by traversing Geodesics

Guruprasad Raghavan; Matt Thomson

arXiv:2012.09605·cs.LG·December 18, 2020·1 cites

Sparsifying networks by traversing Geodesics

Guruprasad Raghavan, Matt Thomson

PDF

Open Access

TL;DR

This paper introduces a geometric framework to identify high-performance paths in neural network weight spaces, enabling effective sparsification and potentially addressing issues like catastrophic forgetting.

Contribution

It proposes a novel mathematical approach to evaluate geodesics in functional space, facilitating the transition from dense to sparse networks while maintaining performance.

Findings

01

Successful application on VGG-11 with CIFAR-10

02

Effective sparsification of MLPs on MNIST

03

Framework's versatility for various problems

Abstract

The geometry of weight spaces and functional manifolds of neural networks play an important role towards 'understanding' the intricacies of ML. In this paper, we attempt to solve certain open questions in ML, by viewing them through the lens of geometry, ultimately relating it to the discovery of points or paths of equivalent function in these spaces. We propose a mathematical framework to evaluate geodesics in the functional space, to find high-performance paths from a dense network to its sparser counterpart. Our results are obtained on VGG-11 trained on CIFAR-10 and MLP's trained on MNIST. Broadly, we demonstrate that the framework is general, and can be applied to a wide variety of problems, ranging from sparsification to alleviating catastrophic forgetting.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Model Reduction and Neural Networks · Medical Imaging and Analysis