# A new approach to hierarchical data analysis: Targeted maximum   likelihood estimation for the causal effect of a cluster-level exposure

**Authors:** Laura B. Balzer, Wenjing Zheng, Mark J. van der Laan, Maya L. Petersen, (for the SEARCH Collaboration)

arXiv: 1706.02675 · 2021-07-08

## TL;DR

This paper introduces two targeted maximum likelihood estimators for assessing the causal effect of cluster-level exposures, accommodating complex within-cluster interactions and confounding, with applications to public health interventions.

## Contribution

It develops flexible TMLE methods for hierarchical data, addressing interactions and confounding, and compares their performance through simulations and real-world HIV data analysis.

## Key findings

- The sub-model TMLE can be biased in observational studies.
- Pairing individual risk factors improves estimation accuracy.
- Assumptions during estimation can be more robust than in the causal model.

## Abstract

We often seek to estimate the impact of an exposure naturally occurring or randomly assigned at the cluster-level. For example, the literature on neighborhood determinants of health continues to grow. Likewise, community randomized trials are applied to learn about real-world implementation, sustainability, and population effects of interventions with proven individual-level efficacy. In these settings, individual-level outcomes are correlated due to shared cluster-level factors, including the exposure, as well as social or biological interactions between individuals. To flexibly and efficiently estimate the effect of a cluster-level exposure, we present two targeted maximum likelihood estimators (TMLEs). The first TMLE is developed under a non-parametric causal model, which allows for arbitrary interactions between individuals within a cluster. These interactions include direct transmission of the outcome (i.e. contagion) and influence of one individual's covariates on another's outcome (i.e. covariate interference). The second TMLE is developed under a causal sub-model assuming the cluster-level and individual-specific covariates are sufficient to control for confounding. Simulations compare the alternative estimators and illustrate the potential gains from pairing individual-level risk factors and outcomes during estimation, while avoiding unwarranted assumptions. Our results suggest that estimation under the sub-model can result in bias and misleading inference in an observational setting. Incorporating working assumptions during estimation is more robust than assuming they hold in the underlying causal model. We illustrate our approach with an application to HIV prevention and treatment.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1706.02675/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/1706.02675/full.md

## References

85 references — full list in the complete paper: https://tomesphere.com/paper/1706.02675/full.md

---
Source: https://tomesphere.com/paper/1706.02675