Simple, Efficient and Scalable Structure-aware Adapter Boosts Protein Language Models
Yang Tan, Mingchen Li, Bingxin Zhou, Bozitao Zhong, Lirong Zheng, Pan, Tan, Ziyi Zhou, Huiqun Yu, Guisheng Fan, Liang Hong

TL;DR
SES-Adapter is a novel, scalable adapter method that enhances protein language models by integrating structural information, significantly improving downstream task performance and training efficiency across diverse datasets.
Contribution
The paper introduces SES-Adapter, a simple and scalable structure-aware adapter that effectively incorporates structural embeddings into PLMs for protein analysis, outperforming existing methods.
Findings
Up to 11% improvement in downstream task performance.
Training speed increased by up to 1034%.
Convergence rate doubled with SES-Adapter.
Abstract
Fine-tuning Pre-trained protein language models (PLMs) has emerged as a prominent strategy for enhancing downstream prediction tasks, often outperforming traditional supervised learning approaches. As a widely applied powerful technique in natural language processing, employing Parameter-Efficient Fine-Tuning techniques could potentially enhance the performance of PLMs. However, the direct transfer to life science tasks is non-trivial due to the different training strategies and data forms. To address this gap, we introduce SES-Adapter, a simple, efficient, and scalable adapter method for enhancing the representation learning of PLMs. SES-Adapter incorporates PLM embeddings with structural sequence embeddings to create structure-aware representations. We show that the proposed method is compatible with different PLM architectures and across diverse tasks. Extensive evaluations are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Protein Structure and Dynamics · Microbial Metabolic Engineering and Bioproduction
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Adapter
