# Bayesian Additive Regression Trees for Group Testing Data

**Authors:** Madeleine E. St. Ville, Christopher S. McMahan, Joe D. Bible, Joshua M. Tebbs, Christopher R. Bilder

PMC · DOI: 10.1002/sim.70052 · Statistics in Medicine · 2025-03-14

## TL;DR

This paper introduces a flexible Bayesian method to estimate disease probabilities using group testing data, even when test results may be inaccurate.

## Contribution

The novelty is using Bayesian additive regression trees to model unknown covariate effects in group testing without assuming a fixed functional form.

## Key findings

- The proposed method can estimate individual disease probabilities without knowing true statuses.
- It handles misclassification and works with any group testing protocol.
- The approach avoids model misspecification by allowing flexible covariate effects.

## Abstract

When screening for low‐prevalence diseases, pooling specimens (e.g., blood, urine, swabs, etc.) through group testing has the potential to substantially reduce costs when compared to testing specimens individually. A common goal in group testing applications is to estimate the relationship between an individual's true disease status and their individual‐level covariate information. However, estimating such a relationship is a non‐trivial problem because true individual disease statuses are unknown due to the group testing protocol and the possibility of imperfect testing. While several regression methods have been developed in recent years to accommodate the complexity of group testing data, the functional form of covariate effects is typically assumed to be known. To avoid model misspecification and to provide a more flexible approach, we propose a Bayesian additive regression trees framework to model the individual‐level probability of disease with potentially misclassified group testing data. Our methods can be used to analyze data arising from any group testing protocol with the goal of estimating unknown functions of covariates and assay classification accuracy probabilities.

## Full-text entities

- **Diseases:** STD (MESH:D012749), BART (MESH:C537770), HIV (MESH:D015658), chlamydia (MESH:D002690), SARS-CoV-2 (MESH:D000086382), influenza A/B (MESH:D007251), infectious disease (MESH:D003141), infection (MESH:D007239), Zika (MESH:D000071243), gonorrhea (MESH:D006069), chlamydial (MESH:D061387), tuberculosis (MESH:D014376)
- **Chemicals:** MPT (-)
- **Species:** Chlamydia (genus) [taxon 810]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11907685/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11907685/full.md

## References

35 references — full list in the complete paper: https://tomesphere.com/paper/PMC11907685/full.md

---
Source: https://tomesphere.com/paper/PMC11907685