On Inductive Biases for Machine Learning in Data Constrained Settings
Gr\'egoire Mialon

TL;DR
This paper investigates how incorporating known functions and inductive biases, especially from kernel methods, can improve learning in data-constrained scenarios, focusing on sequences, graphs, and sample efficiency.
Contribution
It introduces a framework replacing some neural modules with known functions to embed inductive biases, enhancing learning with limited data and analyzing sample requirements in convex models.
Findings
Effective use of known functions improves learning in data-scarce settings
Relationship between inductive biases and deep learning advances clarified
Sample screening techniques can identify informative data points efficiently
Abstract
Learning with limited data is one of the biggest problems of machine learning. Current approaches to this issue consist in learning general representations from huge amounts of data before fine-tuning the model on a small dataset of interest. While such technique, coined transfer learning, is very effective in domains such as computer vision or natural langage processing, it does not yet solve common problems of deep learning such as model interpretability or the overall need for data. This thesis explores a different answer to the problem of learning expressive models in data constrained settings: instead of relying on big datasets to learn neural networks, we will replace some modules by known functions reflecting the structure of the data. Very often, these functions will be drawn from the rich literature of kernel methods. Indeed, many kernels can reflect the underlying structure of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Neural Networks and Applications
