# Perspectives on variability of in vivo toxicology studies: considerations for next-generation toxicology

**Authors:** Agnes L. Karmaus, Anna L. Kreutz, Oluwakemi Oyetade, Katie Paul Friedman, Martin Paparella, Emily N. Reinke, David Allen, Helena T. Hogberg, Nicole C. Kleinstreuer

PMC · DOI: 10.3389/ftox.2026.1778353 · Frontiers in Toxicology · 2026-03-02

## TL;DR

This paper reviews variability in animal toxicology studies and discusses how understanding this variability can improve new testing methods and risk assessments.

## Contribution

The paper provides a comprehensive review of variability in in vivo toxicology studies and emphasizes its importance for evaluating new approach methodologies.

## Key findings

- Variability in in vivo toxicology studies varies widely by study type and endpoint complexity.
- Understanding variability sources helps in setting baselines for new testing methods.
- Robust understanding of traditional study performance is crucial for developing species-relevant NAMs.

## Abstract

Animal studies have historically informed toxicological testing and safety assessments. However, assessment of the variability in both quantitative and qualitative results has been limited. Biological variability, experimental differences, interpretation of categorical endpoints, and data availability and curation approaches all contribute to the quantified variability.

A literature review was conducted to identify publications describing variability analyses for in vivo toxicology studies. Variability analyses were evaluated and summarized for a variety of toxicological endpoints: ocular irritation, dermal sensitization and irritation, acute oral and inhalation lethality, subchronic and chronic toxicity, carcinogenicity, neurotoxicity including DNT, endocrine, and genotoxicity.

This review summarizes published investigations of variability within mammalian toxicological studies that have been largely conducted in accordance with health effects test guidelines. The results of this review suggest that replicability of in vivo toxicological guideline studies varies widely by study type, endpoint complexity, and classification approach.

While any test system will have inherent variability, understanding its sources and impact on study interpretation will help ensure that appropriate confidence is applied when using the test method. Furthermore, such information aids in establishing relevant metrics to serve as baselines for informing performance characterization of new approach methodologies (NAMs). Future evaluation of NAMs should be contextualized using estimates of uncertainty and variance of the traditional study data to demonstrate “better” performance compared to traditional testing approaches. Robust understanding of guideline study performance is important for risk assessments, where it is important to find species-relevant NAMs that can perform at least as well as existing bioassays.

## Full-text entities

- **Diseases:** toxicity (MESH:D064420), neurotoxicity (MESH:D020258), ocular irritation (MESH:D001523), carcinogenicity (MESH:D011230)
- **Chemicals:** DNT (MESH:C023514)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12989281/full.md

## References

62 references — full list in the complete paper: https://tomesphere.com/paper/PMC12989281/full.md

---
Source: https://tomesphere.com/paper/PMC12989281