Enzyme promiscuity prediction using hierarchy-informed multi-label classification
Gian Marco Visani, Michael C. Hughes, Soha Hassoun

TL;DR
This paper introduces a hierarchy-informed multi-label neural network model for predicting enzyme promiscuity, leveraging hierarchical enzyme class data and inhibitor information to improve accuracy over other models.
Contribution
It presents a novel hierarchical multi-label neural network model, EPP-HMCNF, for enzyme promiscuity prediction that outperforms existing models and utilizes enzyme hierarchy and inhibitor data.
Findings
Hierarchical neural network outperforms other models
Inhibitor data improves prediction accuracy
Performance drops under realistic data splits
Abstract
As experimental efforts are costly and time consuming, computational characterization of enzyme capabilities is an attractive alternative. We present and evaluate several machine-learning models to predict which of 983 distinct enzymes, as defined via the Enzyme Commission, EC, numbers, are likely to interact with a given query molecule. Our data consists of enzyme-substrate interactions from the BRENDA database. Some interactions are attributed to natural selection and involve the enzyme's natural substrates. The majority of the interactions however involve non-natural substrates, thus reflecting promiscuous enzymatic activities. We frame this enzyme promiscuity prediction problem as a multi-label classification task. We maximally utilize inhibitor and unlabelled data to train prediction models that can take advantage of known hierarchical relationships between enzyme classes. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Bioinformatics · Chemical Synthesis and Analysis
