SIMPLER: Efficient Foundation Model Adaptation via Similarity-Guided Layer Pruning for Earth Observation
V\'ictor Barreiro, Johannes Jakubik, Francisco Arg\"uello, Dora B. Heras

TL;DR
SIMPLER is a method that efficiently reduces model size and inference costs for Earth Observation foundation models by selecting redundant layers based on representation similarity, without additional training or hyperparameter tuning.
Contribution
It introduces a pre-fine-tuning layer pruning technique that leverages representation similarity to optimize model architecture before adaptation.
Findings
Prunes up to 79% of parameters with minimal performance loss.
Achieves 2.1x training and 2.6x inference speedups.
Generalizes across different models and modalities.
Abstract
Fine-tuning foundation models for Earth Observation is computationally expensive, with high training time and memory demands for both training and deployment. Parameter-efficient methods reduce training cost but retain full inference complexity, while post-hoc compression optimizes inference only after costly full fine-tuning. We introduce SIMPLER, a pre-fine-tuning architecture selection method that reduces inference and deployment costs by identifying an effective model depth before adaptation. SIMPLER exploits stabilization of representations in deeper layers of pre-trained vision transformers: it computes layer-wise representation similarity on unlabeled task data and applies an automated scoring function to select redundant layers, with no gradients, magnitude heuristics, or hyperparameter tuning required. On Prithvi-EO-2, SIMPLER prunes up to 79% of parameters while retaining 94%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
