RXNRECer Enables Fine-grained Enzymatic Function Annotation through Active Learning and Protein Language Models
Zhenkun Shi, Jun Zhu, Dehang Wang, BoYu Chen, Qianqian Yuan, Zhitao Mao, Fan Wei, Weining Wu, Xiaoping Liao, Hongwu Ma

TL;DR
RXNRECer is a transformer-based framework that directly predicts enzyme-catalyzed reactions using protein language models and active learning, improving accuracy and interpretability over traditional EC-based methods.
Contribution
It introduces a novel EC-free enzyme reaction prediction model combining protein language modeling and active learning, outperforming existing EC-based approaches.
Findings
16.54% F1 score improvement over baselines
15.43% accuracy gain in reaction prediction
Enables proteome-wide and promiscuity analysis
Abstract
A key challenge in enzyme annotation is identifying the biochemical reactions catalyzed by proteins. Most existing methods rely on Enzyme Commission (EC) numbers as intermediaries: they first predict an EC number and then retrieve the associated reactions. This indirect strategy introduces ambiguity due to the complex many-to-many mappings among proteins, EC numbers, and reactions, and is further complicated by frequent updates to EC numbers and inconsistencies across databases. To address these challenges, we present RXNRECer, a transformer-based ensemble framework that directly predicts enzyme-catalyzed reactions without relying on EC numbers. It integrates protein language modeling and active learning to capture both high-level sequence semantics and fine-grained transformation patterns. Evaluations on curated cross-validation and temporal test sets demonstrate consistent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Machine Learning in Materials Science · Microbial Metabolic Engineering and Bioproduction
