Machine Learning Prediction of Laccase‐Catalyzed Oxidation of Aromatic Compounds Using Curated Enzyme‐Specific Datasets
Yulia Kulagina, Christian Goldhahn, Ramon Weishaupt, Mark Schubert

TL;DR
This paper uses machine learning to predict which aromatic compounds can be oxidized by laccase enzymes, helping to speed up green chemistry experiments.
Contribution
The study introduces a machine learning framework with interpretable models and a visualization tool for predicting laccase-substrate compatibility.
Findings
Random forest models showed consistent performance across different laccase datasets.
ChemBERTa attention analysis identified molecular features linked to oxidation outcomes.
A lightweight tool was developed to visualize predictions on molecular structures.
Abstract
Laccases are multi‐copper oxidase enzymes that oxidize a wide range of aromatic and non‐aromatic compounds using molecular oxygen, producing water as the sole byproduct and making them attractive biocatalysts for green chemistry. However, the ability of laccases to oxidize specific substrates depends on a complex interplay of molecular structure, enzyme properties, redox potential, and environmental context, making laccase–substrate compatibility hard to predict. We apply machine learning models to pre‐screen laccase–substrate combinations, streamlining experimental workflows. We evaluate four classical classifiers and a transformer‐based model (ChemBERTa) on three in‐house curated datasets of aromatic substrates with oxidation profiles for distinct laccases. Overall, the tested models achieve comparable performance, with random forest (RFC) demonstrating more stability across different…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnzyme-mediated dye degradation · Electrochemical sensors and biosensors · Genomics and Phylogenetic Studies
