When few labeled target data suffice: a theory of semi-supervised domain adaptation via fine-tuning from multiple adaptive starts

Wooseok Ha; Yuansi Chen

arXiv:2507.14661·stat.ML·July 22, 2025

When few labeled target data suffice: a theory of semi-supervised domain adaptation via fine-tuning from multiple adaptive starts

Wooseok Ha, Yuansi Chen

PDF

TL;DR

This paper develops a theoretical framework for semi-supervised domain adaptation (SSDA) using structural causal models, introduces the MASFT algorithm for effective fine-tuning from multiple starts, and demonstrates near-optimal performance with limited labeled target data.

Contribution

It provides a new theoretical analysis of SSDA, proposes the MASFT algorithm for practical adaptation, and extends UDA methods to achieve minimax-optimal results with minimal labeled target data.

Findings

01

MASFT achieves near-optimal target performance across various distributional shifts.

02

Extending UDA methods to SSDA under different assumptions is theoretically justified.

03

Empirical validation confirms the effectiveness of the proposed methods.

Abstract

Semi-supervised domain adaptation (SSDA) aims to achieve high predictive performance in the target domain with limited labeled target data by exploiting abundant source and unlabeled target data. Despite its significance in numerous applications, theory on the effectiveness of SSDA remains largely unexplored, particularly in scenarios involving various types of source-target distributional shifts. In this work, we develop a theoretical framework based on structural causal models (SCMs) which allows us to analyze and quantify the performance of SSDA methods when labeled target data is limited. Within this framework, we introduce three SSDA methods, each having a fine-tuning strategy tailored to a distinct assumption about the source and target relationship. Under each assumption, we demonstrate how extending an unsupervised domain adaptation (UDA) method to SSDA can achieve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.