Theoretical Models of Learning to Learn

Jonathan Baxter

arXiv:2002.12364·cs.LG·March 2, 2020·Learning to Learn·42 cites

Theoretical Models of Learning to Learn

Jonathan Baxter

PDF

Open Access

TL;DR

This paper introduces two theoretical models for learning to learn, demonstrating how machines can autonomously develop biases through exposure to related tasks within an environment.

Contribution

It presents the first formal PAC-type and hierarchical Bayes models for bias learning, advancing theoretical understanding of learning to learn.

Findings

01

PAC-type model based on empirical process theory

02

Hierarchical Bayes model for bias learning

03

Theoretical results demonstrating model capabilities

Abstract

A Machine can only learn if it is biased in some way. Typically the bias is supplied by hand, for example through the choice of an appropriate set of features. However, if the learning machine is embedded within an {\em environment} of related tasks, then it can {\em learn} its own bias by learning sufficiently many tasks from the environment. In this paper two models of bias learning (or equivalently, learning to learn) are introduced and the main theoretical results presented. The first model is a PAC-type model based on empirical process theory, while the second is a hierarchical Bayes model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Computability, Logic, AI Algorithms · Neural Networks and Applications