Curvature-informed multi-task learning for graph networks
Alexander New, Michael J. Pekala, Nam Q. Le, Janna Domenico, Christine, D. Piatko, Christopher D. Stiles

TL;DR
This paper investigates why multi-task graph neural networks underperform compared to single-task models by analyzing the curvature of loss surfaces, proposing that differences in curvature hinder efficient learning across tasks.
Contribution
It introduces a spectral analysis method to assess loss surface curvature differences in multi-task learning for graph networks, informing better training strategies.
Findings
Loss surface curvature varies significantly across tasks.
Spectral properties of Hessians can diagnose learning inefficiencies.
Curvature analysis guides improved multi-task training methods.
Abstract
Properties of interest for crystals and molecules, such as band gap, elasticity, and solubility, are generally related to each other: they are governed by the same underlying laws of physics. However, when state-of-the-art graph neural networks attempt to predict multiple properties simultaneously (the multi-task learning (MTL) setting), they frequently underperform a suite of single property predictors. This suggests graph networks may not be fully leveraging these underlying similarities. Here we investigate a potential explanation for this phenomenon: the curvature of each property's loss surface significantly varies, leading to inefficient learning. This difference in curvature can be assessed by looking at spectral properties of the Hessians of each property's loss function, which is done in a matrix-free manner via randomized numerical linear algebra. We evaluate our hypothesis on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Advanced Graph Neural Networks · Neural Networks and Applications
