Gradient Estimation for Unseen Domain Risk Minimization with Pre-Trained Models
Byounggyu Lew, Donghyun Son, Buru Chang

TL;DR
This paper introduces a novel domain generalization method that estimates unobservable gradients to improve unseen domain performance using pre-trained models, balancing task-specific learning and generalization.
Contribution
It proposes a new gradient estimation technique that enables pre-trained models to learn task-specific knowledge without losing their generalization ability.
Findings
Outperforms baseline methods on DomainBed benchmark.
Effectively learns task-specific knowledge without sacrificing generalization.
Demonstrates the effectiveness of unobservable gradient estimation in domain generalization.
Abstract
Domain generalization aims to build generalized models that perform well on unseen domains when only source domains are available for model optimization. Recent studies have shown that large-scale pre-trained models can enhance domain generalization by leveraging their generalization power. However, these pre-trained models lack target task-specific knowledge yet due to discrepancies between the pre-training objectives and the target task. Although the task-specific knowledge could be learned from source domains by fine-tuning, this hurts the generalization power of pre-trained models due to gradient bias toward the source domains. To alleviate this problem, we propose a new domain generalization method that estimates unobservable gradients that reduce potential risks in unseen domains using a large-scale pre-trained model. These estimated unobservable gradients allow the pre-trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Topic Modeling
