Fused Gromov-Wasserstein Contrastive Learning for Effective Enzyme-Reaction Screening
Gengmo Zhou, Feng Yu, Wenda Wang, Zhifeng Gao, Guolin Ke, Zhewei Wei, Zhen Wang

TL;DR
This paper introduces FGW-CLIP, a contrastive learning framework based on fused Gromov-Wasserstein distance, which effectively improves enzyme-reaction screening by capturing hierarchical relationships and integrating information across domains.
Contribution
The paper presents a novel contrastive learning method using fused Gromov-Wasserstein distance that incorporates multiple alignments and regularization for enhanced enzyme-reaction screening.
Findings
Achieves state-of-the-art performance on EnzymeMap benchmark.
Outperforms existing methods across all ReactZyme splits.
Demonstrates robust generalization to novel enzymes and reactions.
Abstract
Enzymes are crucial catalysts that enable a wide range of biochemical reactions. Efficiently identifying specific enzymes from vast protein libraries is essential for advancing biocatalysis. Traditional computational methods for enzyme screening and retrieval are time-consuming and resource-intensive. Recently, deep learning approaches have shown promise. However, these methods focus solely on the interaction between enzymes and reactions, overlooking the inherent hierarchical relationships within each domain. To address these limitations, we introduce FGW-CLIP, a novel contrastive learning framework based on optimizing the fused Gromov-Wasserstein distance. FGW-CLIP incorporates multiple alignments, including inter-domain alignment between reactions and enzymes and intra-domain alignment within enzymes and reactions. By introducing a tailored regularization term, our method minimizes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Bioinformatics and Genomic Networks · Machine Learning in Materials Science
