Pretraining and the Lasso

Erin Craig; Mert Pilanci; Thomas Le Menestrel; Balasubramanian; Narasimhan; Manuel Rivas; Stein-Erik Gullaksen; Roozbeh Dehghannasiri; Julia; Salzman; Jonathan Taylor; Robert Tibshirani

arXiv:2401.12911·stat.ME·October 31, 2024·1 cites

Pretraining and the Lasso

Erin Craig, Mert Pilanci, Thomas Le Menestrel, Balasubramanian, Narasimhan, Manuel Rivas, Stein-Erik Gullaksen, Roozbeh Dehghannasiri, Julia, Salzman, Jonathan Taylor, Robert Tibshirani

PDF

Open Access 1 Repo

TL;DR

This paper explores how pretraining techniques can enhance the performance of the lasso regression method by developing a framework for transfer learning from large to small datasets, improving support recovery.

Contribution

It introduces a novel framework for applying pretraining to the lasso, enabling effective transfer learning and support recovery across related datasets and models.

Findings

01

Pretraining improves support recovery of common coefficients.

02

The framework applies to stratified, multinomial, and multi-response models.

03

Pretraining enhances scientific interpretability of model coefficients.

Abstract

Pretraining is a popular and powerful paradigm in machine learning to pass information from one model to another. As an example, suppose one has a modest-sized dataset of images of cats and dogs, and plans to fit a deep neural network to classify them from the pixel features. With pretraining, we start with a neural network trained on a large corpus of images, consisting of not just cats and dogs but hundreds of other image types. Then we fix all of the network weights except for the top layer(s) (which makes the final classification) and train (or "fine tune") those weights on our dataset. This often results in dramatically better performance than the network trained solely on our smaller dataset. In this paper, we ask the question "Can pretraining help the lasso?". We develop a framework for the lasso in which a model is fit to a large dataset, and then fine-tuned using a smaller…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

erincr/ptLasso
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms