Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models
Kaican Li, Weiyan Xie, Yongxiang Huang, Didan Deng, Lanqing Hong,, Zhenguo Li, Ricardo Silva, Nevin L. Zhang

TL;DR
This paper introduces dual risk minimization (DRM), a novel fine-tuning method that balances expected and worst-case risks to enhance robustness of foundation models against distribution shifts, achieving state-of-the-art results.
Contribution
The paper proposes DRM, a new fine-tuning approach that leverages core features and worst-case risk estimation to improve model robustness beyond existing methods.
Findings
DRM improves out-of-distribution accuracy on multiple benchmarks.
DRM achieves state-of-the-art performance in robustness tasks.
Utilizes core-feature descriptions from LLMs for risk estimation.
Abstract
Fine-tuning foundation models often compromises their robustness to distribution shifts. To remedy this, most robust fine-tuning methods aim to preserve the pre-trained features. However, not all pre-trained features are robust and those methods are largely indifferent to which ones to preserve. We propose dual risk minimization (DRM), which combines empirical risk minimization with worst-case risk minimization, to better preserve the core features of downstream tasks. In particular, we utilize core-feature descriptions generated by LLMs to induce core-based zero-shot predictions which then serve as proxies to estimate the worst-case risk. DRM balances two crucial aspects of model robustness: expected performance and worst-case performance, establishing a new state of the art on various real-world benchmarks. DRM significantly improves the out-of-distribution performance of CLIP…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMedical Imaging Techniques and Applications · Nuclear reactor physics and engineering · Advanced Radiotherapy Techniques
MethodsContrastive Language-Image Pre-training
