A Quantitative Approach to Predicting Representational Learning and   Performance in Neural Networks

Ryan Pyle; Sebastian Musslick; Jonathan D. Cohen; and Ankit B. Patel

arXiv:2307.07575·cs.LG·July 18, 2023

A Quantitative Approach to Predicting Representational Learning and Performance in Neural Networks

Ryan Pyle, Sebastian Musslick, Jonathan D. Cohen, and Ankit B. Patel

PDF

Open Access

TL;DR

This paper introduces a pseudo-kernel based method to analyze and predict how neural networks develop representations and perform on tasks based on initial conditions and training strategies.

Contribution

It presents a novel analytical tool that predicts representational learning and multitask performance from initial network states and training curriculum.

Findings

01

The method accurately predicts the impact of weight initialization scale.

02

It forecasts how training curriculum affects multitask performance.

03

Validated on simple and complex neural network scenarios.

Abstract

A key property of neural networks (both biological and artificial) is how they learn to represent and manipulate input information in order to solve a task. Different types of representations may be suited to different types of tasks, making identifying and understanding learned representations a critical part of understanding and designing useful networks. In this paper, we introduce a new pseudo-kernel based tool for analyzing and predicting learned representations, based only on the initial conditions of the network and the training curriculum. We validate the method on a simple test case, before demonstrating its use on a question about the effects of representational learning on sequential single versus concurrent multitask performance. We show that our method can be used to predict the effects of the scale of weight initialization and training curriculum on representational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning