On Inductive Biases for Machine Learning in Data Constrained Settings

Gr\'egoire Mialon

arXiv:2302.10692·cs.LG·February 22, 2023

On Inductive Biases for Machine Learning in Data Constrained Settings

Gr\'egoire Mialon

PDF

Open Access 1 Repo

TL;DR

This paper investigates how incorporating known functions and inductive biases, especially from kernel methods, can improve learning in data-constrained scenarios, focusing on sequences, graphs, and sample efficiency.

Contribution

It introduces a framework replacing some neural modules with known functions to embed inductive biases, enhancing learning with limited data and analyzing sample requirements in convex models.

Findings

01

Effective use of known functions improves learning in data-scarce settings

02

Relationship between inductive biases and deep learning advances clarified

03

Sample screening techniques can identify informative data points efficiently

Abstract

Learning with limited data is one of the biggest problems of machine learning. Current approaches to this issue consist in learning general representations from huge amounts of data before fine-tuning the model on a small dataset of interest. While such technique, coined transfer learning, is very effective in domains such as computer vision or natural langage processing, it does not yet solve common problems of deep learning such as model interpretability or the overall need for data. This thesis explores a different answer to the problem of learning expressive models in data constrained settings: instead of relying on big datasets to learn neural networks, we will replace some modules by known functions reflecting the structure of the data. Very often, these functions will be drawn from the rich literature of kernel methods. Indeed, many kernels can reflect the underlying structure of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

GregoireMialon/screening_samples
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Neural Networks and Applications