To update or not to update? Neurons at equilibrium in deep models

Andrea Bragagnolo; Enzo Tartaglione; Marco Grangetto

arXiv:2207.09455·cs.LG·November 15, 2022

To update or not to update? Neurons at equilibrium in deep models

Andrea Bragagnolo, Enzo Tartaglione, Marco Grangetto

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a neuron-centric approach called neuronal equilibrium (NEq) that determines when to update neurons in deep models, aiming to improve training efficiency and understanding of neuron behavior.

Contribution

It shifts focus from individual parameters to neuron behavior, proposing NEq to selectively update neurons based on their equilibrium state, enhancing training strategies.

Findings

01

NEq effectively identifies neurons at equilibrium to halt updates.

02

The approach adapts to different learning strategies and tasks.

03

Neuronal equilibrium varies with the learning setup.

Abstract

Recent advances in deep learning optimization showed that, with some a-posteriori information on fully-trained models, it is possible to match the same performance by simply training a subset of their parameters. Such a discovery has a broad impact from theory to applications, driving the research towards methods to identify the minimum subset of parameters to train without look-ahead information exploitation. However, the methods proposed do not match the state-of-the-art performance, and rely on unstructured sparsely connected models. In this work we shift our focus from the single parameters to the behavior of the whole neuron, exploiting the concept of neuronal equilibrium (NEq). When a neuron is in a configuration at equilibrium (meaning that it has learned a specific input-output relationship), we can halt its update; on the contrary, when a neuron is at non-equilibrium, we let…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eidoslab/neq
pytorchOfficial

Videos

To update or not to update? Neurons at equilibrium in deep models· slideslive

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Reinforcement Learning in Robotics · Generative Adversarial Networks and Image Synthesis