Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning   Zero-Shot Models

Kaican Li; Weiyan Xie; Yongxiang Huang; Didan Deng; Lanqing Hong,; Zhenguo Li; Ricardo Silva; Nevin L. Zhang

arXiv:2411.19757·cs.LG·December 2, 2024

Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models

Kaican Li, Weiyan Xie, Yongxiang Huang, Didan Deng, Lanqing Hong,, Zhenguo Li, Ricardo Silva, Nevin L. Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces dual risk minimization (DRM), a novel fine-tuning method that balances expected and worst-case risks to enhance robustness of foundation models against distribution shifts, achieving state-of-the-art results.

Contribution

The paper proposes DRM, a new fine-tuning approach that leverages core features and worst-case risk estimation to improve model robustness beyond existing methods.

Findings

01

DRM improves out-of-distribution accuracy on multiple benchmarks.

02

DRM achieves state-of-the-art performance in robustness tasks.

03

Utilizes core-feature descriptions from LLMs for risk estimation.

Abstract

Fine-tuning foundation models often compromises their robustness to distribution shifts. To remedy this, most robust fine-tuning methods aim to preserve the pre-trained features. However, not all pre-trained features are robust and those methods are largely indifferent to which ones to preserve. We propose dual risk minimization (DRM), which combines empirical risk minimization with worst-case risk minimization, to better preserve the core features of downstream tasks. In particular, we utilize core-feature descriptions generated by LLMs to induce core-based zero-shot predictions which then serve as proxies to estimate the worst-case risk. DRM balances two crucial aspects of model robustness: expected performance and worst-case performance, establishing a new state of the art on various real-world benchmarks. DRM significantly improves the out-of-distribution performance of CLIP…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vaynexie/drm
pytorchOfficial

Videos

Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models· slideslive

Taxonomy

TopicsMedical Imaging Techniques and Applications · Nuclear reactor physics and engineering · Advanced Radiotherapy Techniques

MethodsContrastive Language-Image Pre-training