Self-supervised Pretraining of Cell Segmentation Models
Kaden Stillwagon, Alexandra Dunnum VandeLoo, Benjamin Magondu, Craig R. Forest

TL;DR
This paper introduces DINOCell, a self-supervised pretraining method for cell segmentation that improves performance by adapting representations from DINOv2 to microscopy data, outperforming models based on natural image priors.
Contribution
The paper presents a novel self-supervised framework that adapts DINOv2 representations to microscopy, enhancing cell segmentation accuracy and robustness under domain shift.
Findings
DINOCell achieves a SEG score of 0.784 on LIVECell, a 10.42% improvement over SAM-based models.
DINOCell demonstrates strong zero-shot performance on multiple microscopy datasets.
Self-supervised domain adaptation improves robustness of cell segmentation models.
Abstract
Instance segmentation enables the analysis of spatial and temporal properties of cells in microscopy images by identifying the pixels belonging to each cell. However, progress is constrained by the scarcity of high-quality labeled microscopy datasets. Many recent approaches address this challenge by initializing models with segmentation-pretrained weights from large-scale natural-image models such as Segment Anything Model (SAM). However, representations learned from natural images often encode objectness and texture priors that are poorly aligned with microscopy data, leading to degraded performance under domain shift. We propose DINOCell, a self-supervised framework for cell instance segmentation that leverages representations from DINOv2 and adapts them to microscopy through continued self-supervised training on unlabeled cell images prior to supervised fine-tuning. On the LIVECell…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
