Unsupervised Finetuning

Suichan Li; Dongdong Chen; Yinpeng Chen; Lu Yuan; Lei; Zhang; Qi Chu; Bin Liu; Nenghai Yu

arXiv:2110.09510·cs.CV·October 19, 2021·6 cites

Unsupervised Finetuning

Suichan Li, Dongdong Chen, Yinpeng Chen, Lu Yuan, Lei, Zhang, Qi Chu, Bin Liu, Nenghai Yu

PDF

Open Access

TL;DR

This paper introduces unsupervised finetuning, a method to adapt pretrained models to small unlabeled target datasets by combining source data and target data, improving transfer performance.

Contribution

It proposes two novel strategies, sparse source data replaying and data mixing, to enhance unsupervised finetuning effectiveness.

Findings

01

Improved transfer performance over naive methods.

02

Source data is crucial for effective unsupervised finetuning.

03

Strategies are effective across multiple datasets.

Abstract

This paper studies "unsupervised finetuning", the symmetrical problem of the well-known "supervised finetuning". Given a pretrained model and small-scale unlabeled target data, unsupervised finetuning is to adapt the representation pretrained from the source domain to the target domain so that better transfer performance can be obtained. This problem is more challenging than the supervised counterpart, as the low data density in the small-scale target data is not friendly for unsupervised learning, leading to the damage of the pretrained representation and poor representation in the target domain. In this paper, we find the source data is crucial when shifting the finetuning paradigm from supervise to unsupervise, and propose two simple and effective strategies to combine source and target data into unsupervised finetuning: "sparse source data replaying", and "data mixing". The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Geophysical Methods and Applications · Speech Recognition and Synthesis