Transfer Learning Using Feature Selection

Paramveer S. Dhillon; Dean Foster; Lyle Ungar

arXiv:0905.4022·cs.LG·May 26, 2009·1 cites

Transfer Learning Using Feature Selection

Paramveer S. Dhillon, Dean Foster, Lyle Ungar

PDF

Open Access

TL;DR

This paper introduces three transfer learning methods based on the MDL principle to improve feature selection across multiple tasks, feature classes, and sequential transfer scenarios, demonstrating effectiveness in genomics and WSD.

Contribution

The paper presents novel transfer learning feature selection methods using the MDL principle, addressing simultaneous, class-based, and sequential transfer problems with Bayesian interpretation.

Findings

01

Effective in genomics datasets for small feature sets

02

Improves word sense disambiguation accuracy

03

Beneficial when tasks have unequal data amounts

Abstract

We present three related ways of using Transfer Learning to improve feature selection. The three methods address different problems, and hence share different kinds of information between tasks or feature classes, but all three are based on the information theoretic Minimum Description Length (MDL) principle and share the same underlying Bayesian interpretation. The first method, MIC, applies when predictive models are to be built simultaneously for multiple tasks (``simultaneous transfer'') that share the same set of features. MIC allows each feature to be added to none, some, or all of the task models and is most beneficial for selecting a small set of predictive features from a large pool of features, as is common in genomic and biological datasets. Our second method, TPC (Three Part Coding), uses a similar methodology for the case when the features can be divided into feature…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification