Playing the Lottery With Concave Regularizers for Sparse Trainable   Neural Networks

Giulia Fracastoro; Sophie M. Fosson; Andrea Migliorati; Giuseppe C.; Calafiore

arXiv:2501.11135·cs.LG·January 22, 2025

Playing the Lottery With Concave Regularizers for Sparse Trainable Neural Networks

Giulia Fracastoro, Sophie M. Fosson, Andrea Migliorati, Giuseppe C., Calafiore

PDF

Open Access

TL;DR

This paper introduces a novel approach using concave regularizers to identify sparse subnetworks in neural networks, enhancing training efficiency and model sparsity beyond existing methods.

Contribution

It proposes a new class of lottery ticket algorithms leveraging concave regularization to better find effective sparse subnetworks in neural networks.

Findings

01

Improves performance over state-of-the-art algorithms

02

Theoretically effective in convex frameworks

03

Demonstrates success across various datasets and architectures

Abstract

The design of sparse neural networks, i.e., of networks with a reduced number of parameters, has been attracting increasing research attention in the last few years. The use of sparse models may significantly reduce the computational and storage footprint in the inference phase. In this context, the lottery ticket hypothesis (LTH) constitutes a breakthrough result, that addresses not only the performance of the inference phase, but also of the training phase. It states that it is possible to extract effective sparse subnetworks, called winning tickets, that can be trained in isolation. The development of effective methods to play the lottery, i.e., to find winning tickets, is still an open problem. In this article, we propose a novel class of methods to play the lottery. The key point is the use of concave regularization to promote the sparsity of a relaxed binary mask, which represents…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Machine Learning and Data Classification

MethodsSoftmax · Attention Is All You Need