# An artificial intelligence-driven synthesis planning platform (PhotoCat) for photocatalysis

**Authors:** Jiangcheng Xu, Silong Zhai, Panyi Huang, Wenbo Yu, Qingyi Mao, Kui Du, Weike Su, Bin Sun, Can Jin, An Su

PMC · DOI: 10.1038/s42004-026-01894-y · Communications Chemistry · 2026-01-21

## TL;DR

The paper introduces PhotoCatDB and PhotoCat, an AI platform that improves photocatalytic reaction prediction and accelerates sustainable chemistry discovery.

## Contribution

The novel contribution is the creation of a large annotated database and a high-accuracy AI platform for photocatalysis.

## Key findings

- PhotoCat achieves 82.6% accuracy in photocatalytic reaction prediction.
- The platform experimentally validated four novel photocatalytic reactions with yields up to 75.3%.

## Abstract

While photocatalysis has emerged as a transformative tool in modern synthesis, AI-assisted reaction prediction faces significant challenges due to data limitations. We present PhotoCatDB - a curated, open-source database containing 26.7 K photocatalytic reactions with detailed mechanistic annotations, including 9.2 K multicomponent transformations. Leveraging this resource alongside 100 million molecular data points, we developed PhotoCat, a Transformer-based platform that achieves unprecedented accuracy in photocatalytic reaction prediction (82.6%), retrosynthesis (77.1%), and condition recommendation (88.5%). The platform’s capabilities were experimentally validated through the discovery of four novel photocatalytic reactions with yields up to 75.3%. This integrated approach establishes a new paradigm for data-driven innovation in photocatalysis, bridging computational prediction with experimental validation to accelerate discovery in sustainable chemistry.

Photocatalysis holds promise for sustainable synthesis, but AI-driven reaction prediction is hindered by limited data. Here, the authors introduce PhotoCatDB, a comprehensive database of 26.7 K reactions, and develop PhotoCat, a transformer-based tool achieving high accuracy in photocatalytic reaction prediction, retrosynthesis, and condition recommendation, as well as experimentally validate the newly discovered photocatalytic reactions.

## Full-text entities

- **Genes:** TOP1 (DNA topoisomerase I) [NCBI Gene 7150] {aka TOPI}
- **Chemicals:** water (MESH:D014867), Pd (MESH:D010165), benzene (MESH:D001554), azobenzene (MESH:C009850), cesium carbonate (MESH:C545311), TMAP (MESH:C067366), cinnamic acid (MESH:C029010), petroleum ether (MESH:C004544), alkene (MESH:D000475), Lewis acids (MESH:D058116), EDA (MESH:C564336), NaCN (MESH:D012966), aldehyde (MESH:D000447), boron (MESH:D001895), silica gel (MESH:D058428), pyruvic acid (MESH:D019289), singlet oxygen (MESH:D026082), acid (MESH:D000143), O (MESH:D010100), nitrobenzene (MESH:C036077), CH2Cl2 (MESH:D008752), N (MESH:D009584), benzaldehyde (MESH:C032175), tetramethylsilane (MESH:C073196), ketone (MESH:D007659), trifluoromethanesulfonic acid (MESH:C012077), AgNO3 (MESH:D012835), C (MESH:D002244), Co (MESH:D003035), AlCl3 (MESH:D000077410), THF (MESH:C018674), Eosin Y (MESH:D004801), DMSO (MESH:D004121), Base (-), superoxide (MESH:D013481), ethyl acetate (MESH:C007650), ruthenium (MESH:D012428), carbohydrate (MESH:D002241), K2S2O8 (MESH:C009007)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12913628/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12913628/full.md

## References

6 references — full list in the complete paper: https://tomesphere.com/paper/PMC12913628/full.md

---
Source: https://tomesphere.com/paper/PMC12913628