# Two-stage cluster sampling to assess SARS-CoV-2 seroprevalence without pre-enumeration: An example from Madagascar

**Authors:** Eva Lorenz, John Amuasi, Tiana Randrianarisoa, Tahinamandranto Rasamoelina, Leonard Gunga, Dominik Benke, Jonathan Ströbele, Jenny Kettenbeil, Wibke Loag, Haja Andriamahandry, Landry Razanakolona, Jean Rolland Randrianirina, Hosea Randrianasolo, Jean Christian Ratombotsoa, Fitahina Nahita, Daniel Eibach, Daniela Fusco, Jürgen May, Rivo A. Rakotoarivelo, Aurélia Souares, Emmanuel Bonnet, Nicole S. Struck

PMC · DOI: 10.1371/journal.pone.0334627 · 2025-11-04

## TL;DR

Researchers in Madagascar developed a new survey method using GPS and mapping to study SARS-CoV-2 without needing detailed population lists.

## Contribution

A novel two-stage cluster sampling method using field mapping and GPS to conduct surveys without pre-enumeration.

## Key findings

- 95.3% of randomly generated GPS coordinates successfully located households.
- Participation rate was 96.8% among contacted households.
- The method produced a representative sample matching census data.

## Abstract

Implementing population-based surveys in resource-constrained settings presents logistical challenges when detailed population enumeration is unavailable. We developed a field mapping system integrated into a cluster sampling approach to eliminate pre-enumeration requirements for a SARS-CoV-2 seroprevalence survey in Madagascar. We conducted a cross-sectional observational study in urban Fianarantsoa, Madagascar, between February and June 2021. Using probability proportional to size sampling, we selected clusters from administrative areas (fokontany) and randomly generated GPS coordinates within these clusters. Field teams navigated to coordinates using OpenStreetMap software on tablets, identified eligible households, and conducted health surveys with blood sampling. We employed a mobile-compatible system for real-time household mapping and data collection, functioning without continuous network connectivity. Sample size calculation targeted 650 households (SARS-CoV-2 seroprevalence 30%, precision ±5%, design effect 2.0). Our specific objectives were to develop and implement a geographic cluster sampling method that did not require pre-enumeration; to assess the feasibility of this method through participation rates; and to evaluate potential selection biases related to socioeconomic factors. We identified households at 95.3% (696/730) of randomly generated GPS coordinates. Of contacted households, 96.8% (674/696) participated, representing 1,121 individuals across 57 clusters. Participation rates varied geographically, with a modest inverse correlation with household wealth (participation decreased by 0.85% per wealth quintile increase, 95% CI: −3.54% to 1.84%). Demographic characteristics of our sample matched census data for urban Fianarantsoa, supporting the representativeness of our approach. This integrated field mapping system created a virtual household map simultaneously with survey implementation, enabling cost-effective two-stage cluster sampling without pre-enumeration. The approach enabled evaluation of selection bias, simplified logistics, and provided a permanent geo-referenced database of surveyed households. This methodology offers a practical solution for population-based surveys in resource-constrained settings with incomplete enumeration data and has applications beyond COVID-19 research for various public health surveillance activities.

## Linked entities

- **Diseases:** SARS-CoV-2 (MONDO:0100096)
- **Species:** Homo sapiens (taxon 9606)

## Full-text entities

- **Diseases:** COVID-19 (MESH:D000086382)
- **Species:** Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049]

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12585014/full.md

---
Source: https://tomesphere.com/paper/PMC12585014