On the Power of Source Screening for Learning Shared Feature Extractors
Leo Muxing Wang, Connor Mclaughlin, Lili Su

TL;DR
This paper investigates how selecting the most relevant data sources can improve the learning of shared feature extractors, demonstrating that careful source screening can achieve optimal representation even with less data.
Contribution
It introduces the concept of source screening for shared feature learning, providing algorithms and heuristics to identify informative sources, and proves their effectiveness both theoretically and empirically.
Findings
Source screening can lead to minimax optimal subspace estimation.
Training on a selected subset of sources can match full data performance.
Algorithms effectively identify informative sources in practice.
Abstract
Learning with shared representation is widely recognized as an effective way to separate commonalities from heterogeneity across various heterogeneous sources. Most existing work includes all related data sources via simultaneously training a common feature extractor and source-specific heads. It is well understood that data sources with low relevance or poor quality may hinder representation learning. In this paper, we further dive into the question of which data sources should be learned jointly by focusing on the traditionally deemed ``good'' collection of sources, in which individual sources have similar relevance and qualities with respect to the true underlying common structure. Towards tractability, we focus on the linear setting where sources share a low-dimensional subspace. We find that source screening can play a central role in statistically optimal subspace estimation. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Mobile Crowdsensing and Crowdsourcing · Advanced Graph Neural Networks
