Pseudodata-guided Invariant Representation Learning Boosts the Out-of-Distribution Generalization in Enzymatic Kinetic Parameter Prediction
Haomin Wu, Zhiwei Nie, Hongyu Zhang, Zhixiang Ren

TL;DR
This paper introduces O$^2$DENet, a novel module that improves the robustness and accuracy of enzyme kinetic parameter predictions on out-of-distribution data by using biologically informed perturbations and invariant representation learning.
Contribution
The paper presents O$^2$DENet, a plug-and-play module that enhances OOD generalization in enzyme-substrate interaction models through perturbation augmentation and invariant learning.
Findings
O$^2$DENet improves prediction accuracy on OOD benchmarks.
Achieves state-of-the-art robustness in enzyme kinetics prediction.
Enhances model stability for enzyme engineering applications.
Abstract
Accurate prediction of enzyme kinetic parameters is essential for understanding catalytic mechanisms and guiding enzyme engineering.However, existing deep learning-based enzyme-substrate interaction (ESI) predictors often exhibit performance degradation on sequence-divergent, out-of-distribution (OOD) cases, limiting robustness under biologically relevant perturbations.We propose ODENet, a lightweight, plug-and-play module that enhances OOD generalization via biologically and chemically informed perturbation augmentation and invariant representation learning.ODENet introduces enzyme-substrate perturbations and enforces consistency between original and augmented enzyme-substrate-pair representations to encourage invariance to distributional shifts.When integrated with representative ESI models, ODENet consistently improves predictive performance for both and …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Machine Learning in Materials Science · Protein Degradation and Inhibitors
