Gradient Estimation for Unseen Domain Risk Minimization with Pre-Trained   Models

Byounggyu Lew; Donghyun Son; Buru Chang

arXiv:2302.01497·cs.LG·September 12, 2023

Gradient Estimation for Unseen Domain Risk Minimization with Pre-Trained Models

Byounggyu Lew, Donghyun Son, Buru Chang

PDF

Open Access

TL;DR

This paper introduces a novel domain generalization method that estimates unobservable gradients to improve unseen domain performance using pre-trained models, balancing task-specific learning and generalization.

Contribution

It proposes a new gradient estimation technique that enables pre-trained models to learn task-specific knowledge without losing their generalization ability.

Findings

01

Outperforms baseline methods on DomainBed benchmark.

02

Effectively learns task-specific knowledge without sacrificing generalization.

03

Demonstrates the effectiveness of unobservable gradient estimation in domain generalization.

Abstract

Domain generalization aims to build generalized models that perform well on unseen domains when only source domains are available for model optimization. Recent studies have shown that large-scale pre-trained models can enhance domain generalization by leveraging their generalization power. However, these pre-trained models lack target task-specific knowledge yet due to discrepancies between the pre-training objectives and the target task. Although the task-specific knowledge could be learned from source domains by fine-tuning, this hurts the generalization power of pre-trained models due to gradient bias toward the source domains. To alleviate this problem, we propose a new domain generalization method that estimates unobservable gradients that reduce potential risks in unseen domains using a large-scale pre-trained model. These estimated unobservable gradients allow the pre-trained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Topic Modeling