Mixup for Test-Time Training

Bochao Zhang; Rui Shao; Jingda Du; PC Yuen

arXiv:2210.01640·cs.LG·October 5, 2022

Mixup for Test-Time Training

Bochao Zhang, Rui Shao, Jingda Du, PC Yuen

PDF

Open Access

TL;DR

This paper introduces MixTTT, a mixup-based regularization technique for test-time training that mitigates overfitting and mismatch issues, improving domain adaptation performance.

Contribution

It proposes a novel mixup-based regularization method for test-time training, addressing overfitting and mismatch problems, and providing theoretical analysis of its benefits.

Findings

01

MixTTT improves test-time training performance across multiple benchmarks.

02

Theoretical analysis shows MixTTT alleviates mismatch between model parts.

03

Experimental results confirm effectiveness of MixTTT as an add-on module.

Abstract

Test-time training provides a new approach solving the problem of domain shift. In its framework, a test-time training phase is inserted between training phase and test phase. During test-time training phase, usually parts of the model are updated with test sample(s). Then the updated model will be used in the test phase. However, utilizing test samples for test-time training has some limitations. Firstly, it will lead to overfitting to the test-time procedure thus hurt the performance on the main task. Besides, updating part of the model without changing other parts will induce a mismatch problem. Thus it is hard to perform better on the main task. To relieve above problems, we propose to use mixup in test-time training (MixTTT) which controls the change of model's parameters as well as completing the test-time procedure. We theoretically show its contribution in alleviating the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Vision and Imaging · Human Pose and Action Recognition

MethodsTest · Mixup