Sparsely Grouped Input Variables for Neural Networks
Beibin Li, Nicholas Nuechterlein, Erin Barney, Caitlin Hudac, Pamela, Ventola, Linda Shapiro, Frederick Shic

TL;DR
This paper introduces a novel method for inducing group sparsity in neural networks by using a new loss function and optimization algorithm, effectively selecting relevant input variable groups in real-world datasets.
Contribution
It proposes a new regularization technique and optimization approach for finding sparse input variable groups in multi-layer neural networks, extending beyond linear models.
Findings
Successfully achieved group sparsity in three real-world datasets.
Excluded significant portions of input variables while maintaining performance.
Demonstrated effectiveness in genomic, eye-tracking, and image recognition tasks.
Abstract
In genomic analysis, biomarker discovery, image recognition, and other systems involving machine learning, input variables can often be organized into different groups by their source or semantic category. Eliminating some groups of variables can expedite the process of data acquisition and avoid over-fitting. Researchers have used the group lasso to ensure group sparsity in linear models and have extended it to create compact neural networks in meta-learning. Different from previous studies, we use multi-layer non-linear neural networks to find sparse groups for input variables. We propose a new loss function to regularize parameters for grouped input variables, design a new optimization algorithm for this loss function, and test these methods in three real-world settings. We achieve group sparsity for three datasets, maintaining satisfying results while excluding one nucleotide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Gaussian Processes and Bayesian Inference
MethodsTest
