NOCTIS: open-source toolkit that turns reaction data into actionable graph networks
Nataliya Lopanitsyna, Marta Pasquini, Marco Stenta

TL;DR
NOCTIS is an open-source toolkit that converts chemical reaction data into graph networks to help design efficient synthetic routes.
Contribution
NOCTIS introduces a modular, open-source framework for constructing and analyzing reaction graphs with route enumeration capabilities.
Findings
NOCTIS supports large-scale reaction data analysis using graph-based methods and parallel processing.
The plugin enables exhaustive synthetic route enumeration, reducing redundant exploration.
The toolkit is demonstrated using the MIT USPTO-480k dataset, showcasing route mining and network analysis.
Abstract
Chemical reactions form densely connected networks, and exploring these networks is essential for designing efficient and sustainable synthetic routes. As reaction data from literature, patents, and high-throughput experimentation continue to grow, so does the need for tools that can navigate and mine these large-scale datasets. Graph-based representations capture the topology of reaction space, yet few open-source tools exist for building and querying such networks. To address this, we developed NOCTIS, an open-source toolkit for constructing and analyzing reaction data as graphs. NOCTIS is an open-source Python package for building Networks of Organic Chemistry (NOCs) from reaction strings. It supports graph-based analysis, parallel processing of large datasets, and export to common Python formats (e.g., NetworkX, pandas). Built on Neo4j technology, it features a modular, extensible…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Bioinformatics and Genomic Networks · Graph Theory and Algorithms
