Generalization of Self-Supervised Vision Transformers for Protein Localization Across Microscopy Domains

Ben Isselmann; Dilara G\"oksu; and Andreas Weinmann

arXiv:2602.05527·cs.CV·February 9, 2026

Generalization of Self-Supervised Vision Transformers for Protein Localization Across Microscopy Domains

Ben Isselmann, Dilara G\"oksu, and Andreas Weinmann

PDF

Open Access

TL;DR

This study demonstrates that self-supervised pretrained Vision Transformers can effectively transfer across different microscopy domains for protein localization, achieving high accuracy even with limited labeled data.

Contribution

It shows that domain-specific self-supervised pretraining enhances transferability of Vision Transformers across microscopy datasets, outperforming models trained directly on target data.

Findings

01

HPA-pretrained model achieved the highest macro F1-score of 0.8221.

02

All pretrained models transferred well across domains.

03

Domain-relevant SSL representations generalize effectively to related microscopy datasets.

Abstract

Task-specific microscopy datasets are often too small to train deep learning models that learn robust feature representations. Self-supervised learning (SSL) can mitigate this by pretraining on large unlabeled datasets, but it remains unclear how well such representations transfer across microscopy domains with different staining protocols and channel configurations. We investigate the cross-domain transferability of DINO-pretrained Vision Transformers for protein localization on the OpenCell dataset. We generate image embeddings using three DINO backbones pretrained on ImageNet-1k, the Human Protein Atlas (HPA), and OpenCell, and evaluate them by training a supervised classification head on OpenCell labels. All pretrained models transfer well, with the microscopy-specific HPA-pretrained model achieving the best performance (mean macro $F_{1}$ -score = 0.8221 $\pm$ 0.0062), slightly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCell Image Analysis Techniques · Machine Learning in Bioinformatics · Domain Adaptation and Few-Shot Learning