PEER: A Comprehensive and Multi-Task Benchmark for Protein Sequence Understanding
Minghao Xu, Zuobai Zhang, Jiarui Lu, Zhaocheng Zhu, Yangtian Zhang,, Chang Ma, Runcheng Liu, Jian Tang

TL;DR
PEER is a comprehensive multi-task benchmark for protein sequence understanding, evaluating various methods across diverse protein tasks and demonstrating the superiority of large-scale pre-trained models, with multi-task training enhancing performance.
Contribution
This paper introduces PEER, the first standardized multi-task benchmark for protein sequence understanding, enabling consistent evaluation of different deep learning approaches.
Findings
Pre-trained protein language models outperform traditional methods.
Multi-task learning improves performance across tasks.
Benchmark datasets and code are publicly available.
Abstract
We are now witnessing significant progress of deep learning methods in a variety of tasks (or datasets) of proteins. However, there is a lack of a standard benchmark to evaluate the performance of different methods, which hinders the progress of deep learning in this field. In this paper, we propose such a benchmark called PEER, a comprehensive and multi-task benchmark for Protein sEquence undERstanding. PEER provides a set of diverse protein understanding tasks including protein function prediction, protein localization prediction, protein structure prediction, protein-protein interaction prediction, and protein-ligand interaction prediction. We evaluate different types of sequence-based methods for each task including traditional feature engineering approaches, different sequence encoding methods as well as large-scale pre-trained protein language models. In addition, we also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning in Bioinformatics · Protein Structure and Dynamics · Genomics and Phylogenetic Studies
