The Effects of Multi-Task Learning on ReLU Neural Network Functions

Julia Nakhleh; Joseph Shenouda; Robert D. Nowak

arXiv:2410.21696·stat.ML·February 28, 2025

The Effects of Multi-Task Learning on ReLU Neural Network Functions

Julia Nakhleh, Joseph Shenouda, Robert D. Nowak

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper reveals that multi-task shallow ReLU neural networks often produce solutions akin to kernel regression, showing unique solutions in certain cases and connecting neural network solutions to kernel methods and Sobolev spaces.

Contribution

It establishes a novel connection between multi-task neural networks and kernel methods, proving solution uniqueness and characterizing the solutions as minimum-norm problems in Sobolev spaces.

Findings

01

Solutions resemble kernel regression for each task

02

Multi-task solutions are almost always unique

03

Large number of tasks lead to Hilbert space minimization

Abstract

This paper studies the properties of solutions to multi-task shallow ReLU neural network learning problems, wherein the network is trained to fit a dataset with minimal sum of squared weights. Remarkably, the solutions learned for each individual task resemble those obtained by solving a kernel regression problem, revealing a novel connection between neural networks and kernel methods. It is known that single-task neural network learning problems are equivalent to a minimum norm interpolation problem in a non-Hilbertian Banach space, and that the solutions of such problems are generally non-unique. In contrast, we prove that the solutions to univariate-input, multi-task neural network interpolation problems are almost always unique, and coincide with the solution to a minimum-norm interpolation problem in a Sobolev (Reproducing Kernel) Hilbert Space. We also demonstrate a similar…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

joeshenouda/effects-mtl-nns
pytorchOfficial

Datasets

Kylan12/Synthetic-AI-ML-Dataset
dataset· 42 dl
42 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

Methods*Communicated@Fast*How Do I Communicate to Expedia?