# Systematic analysis of hepatotoxicity: combining literature mining and AI language models

**Authors:** Chris Bauer, Long Tran Duc Dang, Twan van den Beucken, Johannes Schuchhardt, Ralf Herwig

PMC · DOI: 10.3389/frai.2025.1561292 · Frontiers in Artificial Intelligence · 2025-07-21

## TL;DR

This paper uses AI and text mining to automatically identify compounds that may cause liver damage, improving toxicity assessment efficiency.

## Contribution

Combines text mining, word embeddings, and large language models to assess hepatotoxicity with high accuracy.

## Key findings

- Text mining achieved an AUC of 0.8 in DILI validation.
- Large language models outperformed text mining with an AUC of 0.85.
- Combining methods improved performance to an AUC of 0.87.

## Abstract

The body of toxicological knowledge and literature is expanding at an accelerating pace. This rapid growth presents significant challenges for researchers, who must stay abreast with latest studies while also synthesizing the vast amount of published information.

Our goal is to automatically identify potential hepatoxicants from over 50,000 compounds using the wealth of scientific publications and knowledge.

We employ and compare three distinct methods for automatic information extraction from unstructured text: (1) text mining (2) word embeddings and (3) large language models. These approaches are combined to calculate a hepatotoxicity score for over 50,000 compounds. We assess the performance of the different methods with a use case on Drug-Induced Liver Injury (DILI).

We evaluated hepatotoxicity for over 50,000 compounds and calculated a hepatotoxicity score for each compound. Our results indicate that text mining is effective for this purpose, achieving an Area Under the Curve (AUC) of 0.8 in DILI validation. Large language models performed even better, with an AUC of 0.85, thanks to their ability to interpret the semantic context accurately. Combining these methods further improved performance, yielding an AUC of 0.87 in DILI validation. All findings are available for download to support further research on toxicity assessment.

We demonstrated that automated text mining is able to successfully assess the toxicity of compounds. A text mining approach seems to be superior to word embeddings. However, the application of a large language model with prompt engineering showed the best performance.

## Linked entities

- **Diseases:** Drug-Induced Liver Injury (MONDO:0005359)

## Full-text entities

- **Diseases:** DILI (MESH:D056486), toxicity (MESH:D064420)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12338115/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12338115/full.md

## References

23 references — full list in the complete paper: https://tomesphere.com/paper/PMC12338115/full.md

---
Source: https://tomesphere.com/paper/PMC12338115