ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts

Samar Khanna; Medhanie Irgau; David B. Lobell; Stefano Ermon

arXiv:2406.10973·cs.CV·January 1, 2026

ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts

Samar Khanna, Medhanie Irgau, David B. Lobell, Stefano Ermon

PDF

Open Access 1 Models 1 Video

TL;DR

ExPLoRA introduces a parameter-efficient, self-supervised pre-training method for vision transformers that enhances domain adaptation, outperforming traditional fine-tuning approaches especially in resource-constrained settings.

Contribution

The paper proposes ExPLoRA, a novel technique combining self-supervised pre-training and LoRA-based fine-tuning to improve domain transfer of vision transformers with minimal additional parameters.

Findings

01

Achieves state-of-the-art results on satellite imagery domain.

02

Up to 8% improvement in linear probing accuracy.

03

Uses less than 10% of parameters compared to full fine-tuning.

Abstract

Parameter-efficient fine-tuning (PEFT) techniques such as low-rank adaptation (LoRA) can effectively adapt large pre-trained foundation models to downstream tasks using only a small fraction (0.1%-10%) of the original trainable weights. An under-explored question of PEFT is in extending the pre-training phase without supervised labels; that is, can we adapt a pre-trained foundation model to a new domain via efficient self-supervised pre-training on this domain? In this work, we introduce ExPLoRA, a highly effective technique to improve transfer learning of pre-trained vision transformers (ViTs) under domain shifts. Initializing a ViT with pre-trained weights on large, natural-image datasets such as from DinoV2 or MAE, ExPLoRA continues the unsupervised pre-training objective on a new domain, unfreezing 1-2 pre-trained ViT blocks and tuning all other layers with LoRA. We then fine-tune…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
samarkhanna/ExPLoRA
model· ♡ 2
♡ 2

Videos

ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques

MethodsMasked autoencoder