Learning Internal Representations (COLT 1995)
Jonathan Baxter

TL;DR
This paper introduces a method for automatically learning internal representations across multiple tasks to improve generalization and reduce the number of examples needed for new tasks, with theoretical guarantees and neural network training evidence.
Contribution
It proposes a novel framework for learning internal representations from many tasks, providing theoretical bounds on sample complexity and demonstrating neural network training for representation learning.
Findings
Sample complexity per task decreases with more tasks ($O(a+b/n)$)
Representation learning reduces examples needed for new tasks to $O(a)$
Neural network training supports theoretical results
Abstract
Probably the most important problem in machine learning is the preliminary biasing of a learner's hypothesis space so that it is small enough to ensure good generalisation from reasonable training sets, yet large enough that it contains a good solution to the problem being learnt. In this paper a mechanism for {\em automatically} learning or biasing the learner's hypothesis space is introduced. It works by first learning an appropriate {\em internal representation} for a learning environment and then using that representation to bias the learner's hypothesis space for the learning of future tasks drawn from the same environment. An internal representation must be learnt by sampling from {\em many similar tasks}, not just a single task as occurs in ordinary machine learning. It is proved that the number of examples {\em per task} required to ensure good generalisation from a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
