Generalization Through the Lens of Learning Dynamics

Clare Lyle

arXiv:2212.05377·cs.LG·December 13, 2022

Generalization Through the Lens of Learning Dynamics

Clare Lyle

PDF

Open Access

TL;DR

This paper explores how learning dynamics influence the ability of deep neural networks, in supervised and reinforcement learning, to generalize effectively to new, unseen situations, addressing a key challenge in deploying reliable AI systems.

Contribution

It offers new insights into the role of learning dynamics in neural network generalization, bridging gaps in understanding for supervised and reinforcement learning.

Findings

01

Learning dynamics significantly impact generalization performance.

02

Deep neural networks can generalize well despite theoretical challenges.

03

Insights help improve the reliability of AI systems in real-world applications.

Abstract

A machine learning (ML) system must learn not only to match the output of a target function on a training set, but also to generalize to novel situations in order to yield accurate predictions at deployment. In most practical applications, the user cannot exhaustively enumerate every possible input to the model; strong generalization performance is therefore crucial to the development of ML systems which are performant and reliable enough to be deployed in the real world. While generalization is well-understood theoretically in a number of hypothesis classes, the impressive generalization performance of deep neural networks has stymied theoreticians. In deep reinforcement learning (RL), our understanding of generalization is further complicated by the conflict between generalization and stability in widely-used RL algorithms. This thesis will provide insight into generalization by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications