The Effects of Multi-Task Learning on ReLU Neural Network Functions
Julia Nakhleh, Joseph Shenouda, Robert D. Nowak

TL;DR
This paper reveals that multi-task shallow ReLU neural networks often produce solutions akin to kernel regression, showing unique solutions in certain cases and connecting neural network solutions to kernel methods and Sobolev spaces.
Contribution
It establishes a novel connection between multi-task neural networks and kernel methods, proving solution uniqueness and characterizing the solutions as minimum-norm problems in Sobolev spaces.
Findings
Solutions resemble kernel regression for each task
Multi-task solutions are almost always unique
Large number of tasks lead to Hilbert space minimization
Abstract
This paper studies the properties of solutions to multi-task shallow ReLU neural network learning problems, wherein the network is trained to fit a dataset with minimal sum of squared weights. Remarkably, the solutions learned for each individual task resemble those obtained by solving a kernel regression problem, revealing a novel connection between neural networks and kernel methods. It is known that single-task neural network learning problems are equivalent to a minimum norm interpolation problem in a non-Hilbertian Banach space, and that the solutions of such problems are generally non-unique. In contrast, we prove that the solutions to univariate-input, multi-task neural network interpolation problems are almost always unique, and coincide with the solution to a minimum-norm interpolation problem in a Sobolev (Reproducing Kernel) Hilbert Space. We also demonstrate a similar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia?
