Protein Language Model Zero-Shot Fitness Predictions are Improved by Inference-only Dropout

Aditya Ravuri; Neil D. Lawrence

arXiv:2506.14793·cs.LG·June 19, 2025

Protein Language Model Zero-Shot Fitness Predictions are Improved by Inference-only Dropout

Aditya Ravuri, Neil D. Lawrence

PDF

Open Access

TL;DR

This paper demonstrates that applying dropout during inference in protein language models enhances zero-shot protein fitness prediction accuracy without retraining, using a simple averaging technique.

Contribution

Introducing inference-only dropout in PLMs improves zero-shot protein property predictions without retraining or fine-tuning.

Findings

01

Inference-only dropout increases prediction accuracy.

02

A dropout rate of 0.1 is generally effective.

03

Method works across multiple PLMs without retraining.

Abstract

Protein Language Models (PLMs) such as ESM2 have been shown to be capable of zero-shot prediction of critical scalar properties of proteins (fitness). In this work, we show that injecting a dropout layer at inference time between a PLM's featurizer/embedding layer and its transformer, and averaging its output akin to Monte-Carlo dropout increases zero-shot performance on a subset of the ProteinGym dataset. This is the case even when the model was not trained with dropouts to begin with, and does not require retraining or finetuning of the PLM. A dropout of 0.1 seems performant across all models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Bioinformatics · Topic Modeling · Genomics and Phylogenetic Studies