# MoltiTox: a multimodal fusion model for molecular toxicity prediction

**Authors:** Junwoo Park, Sujee Lee

PMC · DOI: 10.3389/ftox.2025.1720651 · Frontiers in Toxicology · 2025-12-18

## TL;DR

MoltiTox is a new model that combines multiple types of molecular data to better predict toxicity in drug discovery.

## Contribution

Introduces a multimodal fusion model that integrates molecular graphs, SMILES, images, and NMR spectra for toxicity prediction.

## Key findings

- MoltiTox achieves a ROC-AUC of 0.831 on the Tox21 benchmark, outperforming single-modality models.
- Incorporating 13C NMR data provides additional chemical insights not captured by structure or language-based methods.

## Abstract

We introduce MoltiTox, a novel multimodal fusion model for molecular toxicity prediction, designed to overcome the limitations of single-modality approaches in drug discovery.

MoltiTox integrates four complementary data types: molecular graphs, SMILES strings, 2D images, and 13C NMR spectra. The model processes these inputs using four modality-specific encoders, including a GNN, a Transformer, a 2D CNN, and a 1D CNN. These heterogeneous embeddings are fused through an attention-based mechanism, enabling the model to capture complementary structural and chemical information from multiple molecular perspectives.

Evaluated on the Tox21 benchmark across 12 endpoints, MoltiTox achieves a ROC-AUC of 0.831, outperforming all single-modality baselines.

These findings highlight that integrating diverse molecular representations enhances both the robustness and generalizability of toxicity prediction models. Beyond predictive performance, the inclusion of 13C NMR data offers complementary chemical insights that are not fully captured by structure or language-based representations, suggesting its potential contribution to mechanistic understanding of molecular toxicity. By demonstrating how multimodal integration enriches molecular representations and enhances the interpretability of toxicity mechanisms, MoltiTox provides an extensible framework for developing more reliable models in computational toxicology.

## Full-text entities

- **Diseases:** toxicity (MESH:D064420)
- **Chemicals:** MoltiTox (-), 13C (MESH:C000615229)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12756560/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12756560/full.md

## References

62 references — full list in the complete paper: https://tomesphere.com/paper/PMC12756560/full.md

---
Source: https://tomesphere.com/paper/PMC12756560