Learning Internal Representations (PhD Thesis)
Jonathan Baxter

TL;DR
This thesis explores how learning shared representations across multiple tasks improves generalization in machine learning, providing theoretical bounds on sample efficiency and extending to hypothesis space biases.
Contribution
It introduces the concept of environment-based representation learning and derives bounds on sample sizes needed for effective generalization across tasks.
Findings
Learning a shared representation reduces the number of examples needed per task.
Bounds are established for the number of tasks and samples for effective representation learning.
Representation learning can be extended to other forms of hypothesis space bias.
Abstract
Most machine learning theory and practice is concerned with learning a single task. In this thesis it is argued that in general there is insufficient information in a single task for a learner to generalise well and that what is required for good generalisation is information about many similar learning tasks. Similar learning tasks form a body of prior information that can be used to constrain the learner and make it generalise better. Examples of learning scenarios in which there are many similar tasks are handwritten character recognition and spoken word recognition. The concept of the environment of a learner is introduced as a probability measure over the set of learning problems the learner might be expected to learn. It is shown how a sample from the environment may be used to learn a representation, or recoding of the input space that is appropriate for the environment.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Algorithms and Data Compression · Machine Learning and Data Classification
