Masked Autoencoders are Scalable Learners of Cellular Morphology

Oren Kraus; Kian Kenyon-Dean; Saber Saberian; Maryam Fallah; Peter; McLean; Jess Leung; Vasudev Sharma; Ayla Khan; Jia Balakrishnan; Safiye; Celik; Maciej Sypetkowski; Chi Vicky Cheng; Kristen Morse; Maureen Makes; Ben; Mabey; Berton Earnshaw

arXiv:2309.16064·cs.CV·November 29, 2023·6 cites

Masked Autoencoders are Scalable Learners of Cellular Morphology

Oren Kraus, Kian Kenyon-Dean, Saber Saberian, Maryam Fallah, Peter, McLean, Jess Leung, Vasudev Sharma, Ayla Khan, Jia Balakrishnan, Safiye, Celik, Maciej Sypetkowski, Chi Vicky Cheng, Kristen Morse, Maureen Makes, Ben, Mabey, Berton Earnshaw

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that large-scale self-supervised masked autoencoders, especially ViT-based models, significantly improve the inference of biological relationships from cellular microscopy images compared to previous methods.

Contribution

It shows that scaling up masked autoencoders on large microscopy datasets enhances biological signal capture, outperforming weakly supervised baselines.

Findings

01

ViT-L/8 trained on 3.5 billion crops outperforms baselines by up to 28%.

02

Self-supervised models better capture biological relationships than hand-crafted features.

03

Scaling models improves inference accuracy on cellular morphology data.

Abstract

Inferring biological relationships from cellular phenotypes in high-content microscopy screens provides significant opportunity and challenge in biological research. Prior results have shown that deep vision models can capture biological signal better than hand-crafted features. This work explores how self-supervised deep learning approaches scale when training larger models on larger microscopy datasets. Our results show that both CNN- and ViT-based masked autoencoders significantly outperform weakly supervised baselines. At the high-end of our scale, a ViT-L/8 trained on over 3.5-billion unique crops sampled from 93-million microscopy images achieves relative improvements as high as 28% over our best weakly supervised baseline at inferring known biological relationships curated from public databases. Relevant code and select models released with this work can be found at:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

recursionpharma/maes_microscopy
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCell Image Analysis Techniques · Image Processing Techniques and Applications · Digital Imaging for Blood Diseases