ProtSolM: Protein Solubility Prediction with Multi-modal Features
Yang Tan, Jia Zheng, Liang Hong, Bingxin Zhou

TL;DR
ProtSolM is a deep learning model that combines multi-modal features and large datasets to improve the accuracy of protein solubility prediction, advancing computational methods in protein engineering.
Contribution
It introduces a novel multi-modal deep learning approach that integrates physicochemical, sequence, and structural data, trained on the largest solubility dataset to date.
Findings
Achieved state-of-the-art performance on multiple evaluation metrics.
Demonstrated significant improvement over existing methods.
Provided a comprehensive leaderboard for protein solubility prediction.
Abstract
Understanding protein solubility is essential for their functional applications. Computational methods for predicting protein solubility are crucial for reducing experimental costs and enhancing the efficiency and success rates of protein engineering. Existing methods either construct a supervised learning scheme on small-scale datasets with manually processed physicochemical properties, or blindly apply pre-trained protein language models to extract amino acid interaction information. The scale and quality of available training datasets leave significant room for improvement in terms of accuracy and generalization. To address these research gaps, we propose \sol, a novel deep learning method that combines pre-training and fine-tuning schemes for protein solubility prediction. ProtSolM integrates information from multiple dimensions, including physicochemical properties, amino acid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein purification and stability · Computational Drug Discovery Methods · Crystallization and Solubility Studies
