# Read-Across Structural Analysis of PFAS Acute Oral Toxicity in Rats Powered by the Isalos Analytics Platform’s Automated Machine Learning

**Authors:** Aikaterini Theodori, Konstantinos D. Papavasileiou, Andreas Tsoumanis, Georgia Melagraki, Antreas Afantitis

PMC · DOI: 10.3390/toxics14020152 · 2026-02-03

## TL;DR

This paper presents a machine learning model to predict PFAS toxicity in rats and identifies structural features linked to high toxicity.

## Contribution

A novel automated machine learning framework for PFAS toxicity prediction and a freely accessible web application for high-throughput screening.

## Key findings

- A k-nearest neighbours model achieved 81.5% accuracy in predicting PFAS acute oral toxicity.
- Polyaromatic and heterocyclic structures are consistently associated with high toxicity.
- The model is available via the INSIGHT RatTox web application for public use.

## Abstract

The ubiquity and environmental persistence of per- and polyfluoroalkyl substances (PFASs) have raised significant concerns about their detrimental effects on human health. Collective scientific efforts are increasingly focused on elucidating PFAS toxicity mechanisms and identifying potential low-impact PFAS structures that retain the exceptional properties of this chemical class. To advance the use of in silico methods in PFAS toxicity assessment, we developed a robust modelling framework for predicting PFAS acute oral toxicity class (high or low) in rats, leveraging the enhanced capabilities of the in-house Isalos Analytics Platform. The automated machine learning (autoML) functionality was employed to optimise four ML models—k-nearest neighbours (kNN), Random Forest (RF), eXtreme Gradient Boosting (XGBoost), and fully connected neural network (NN)—using Mold2 molecular descriptors, and to identify the top-performing model through five-fold cross-validation. The selected kNN model (k = 3) was used for predictions on the held-out testing set, achieving an accuracy of 81.5%, while a Shapley values analysis provided valuable insights into the factors influencing toxicity predictions. Furthermore, the nearest-neighbour-based methodology enabled a read-across structural analysis of PFAS similarity groups consisting of each testing set instance and its three closest neighbours in the training set. This analysis revealed a consistent association between polyaromatic and heterocyclic structural features and high acute oral toxicity. The developed, thoroughly validated read-across model is freely accessible through the INSIGHT RatTox web application as well as the INSIGHT Cheminformatics Platform in Enalos Cloud, supporting high-throughput screening of PFAS compounds and investigation of structural similarities with their nearest neighbours for enriched structural interpretation.

## Linked entities

- **Species:** Rattus norvegicus (taxon 10116)

## Full-text entities

- **Genes:** SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}, PFAS (phosphoribosylformylglycinamidine synthase) [NCBI Gene 5198] {aka FGAMS, FGAR-AT, FGARAT, GATD8, PURL}, PPARG (peroxisome proliferator activated receptor gamma) [NCBI Gene 5468] {aka CIMT1, FPLD3, GLM1, NR1C3, PPARG1, PPARG2}, PFAS [NCBI Gene 107099984], Pfas (phosphoribosylformylglycinamidine synthase) [NCBI Gene 287420], PPARD (peroxisome proliferator activated receptor delta) [NCBI Gene 5467] {aka FAAR, NR1C2, NUC1, NUCI, NUCII, PPARB}
- **Diseases:** Oral Toxicity (MESH:D064420), liver and pericardial damage (MESH:D056486), thyroid hormone dysregulation (MESH:D018382), injury to (MESH:D014947), neurotoxic (MESH:D020258), inhalation (MESH:D015208), Acute Toxicity (MESH:D000208)
- **Chemicals:** CAS 363-97-3 (-), amine (MESH:D000588), Cl (MESH:D002713), oil (MESH:D009821), urea (MESH:D014508), sulfonate (MESH:D000476), ROS (MESH:D017382), halogen (MESH:D006219), alcohols (MESH:D000438), PFOS (MESH:C076994), hydrogen (MESH:D006859), salts (MESH:D012492), metal (MESH:D008670), triglyceride (MESH:D014280), carbon (MESH:D002244), nitrogen (MESH:D009584), carboxylic acid (MESH:D002264), imidazole (MESH:C029899), water (MESH:D014867), benzene (MESH:D001554), fluorine (MESH:D005461), per- and polyfluoroalkyl substances (MESH:D005466), PFOA (MESH:C023036), benzimidazole (MESH:C031000)
- **Species:** Homo sapiens (human, species) [taxon 9606], Rattus norvegicus (brown rat, species) [taxon 10116], Cyprinodon variegatus (sheepshead minnow, species) [taxon 28743], Mus musculus (house mouse, species) [taxon 10090]

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12944375/full.md

---
Source: https://tomesphere.com/paper/PMC12944375