Transfer Learning Using Feature Selection
Paramveer S. Dhillon, Dean Foster, Lyle Ungar

TL;DR
This paper introduces three transfer learning methods based on the MDL principle to improve feature selection across multiple tasks, feature classes, and sequential transfer scenarios, demonstrating effectiveness in genomics and WSD.
Contribution
The paper presents novel transfer learning feature selection methods using the MDL principle, addressing simultaneous, class-based, and sequential transfer problems with Bayesian interpretation.
Findings
Effective in genomics datasets for small feature sets
Improves word sense disambiguation accuracy
Beneficial when tasks have unequal data amounts
Abstract
We present three related ways of using Transfer Learning to improve feature selection. The three methods address different problems, and hence share different kinds of information between tasks or feature classes, but all three are based on the information theoretic Minimum Description Length (MDL) principle and share the same underlying Bayesian interpretation. The first method, MIC, applies when predictive models are to be built simultaneously for multiple tasks (``simultaneous transfer'') that share the same set of features. MIC allows each feature to be added to none, some, or all of the task models and is most beneficial for selecting a small set of predictive features from a large pool of features, as is common in genomic and biological datasets. Our second method, TPC (Three Part Coding), uses a similar methodology for the case when the features can be divided into feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
