# Automating incidence and prevalence analysis in open cohorts

**Authors:** Neil Cockburn, Ben Hammond, Illin Gani, Samuel Cusworth, Aditya Acharya, Krishna Gokhale, Rasiah Thayakaran, Francesca Crowe, Sonica Minhas, William Parry Smith, Beck Taylor, Krishnarajah Nirantharakumar, Joht Singh Chandan

PMC · DOI: 10.1186/s12874-024-02266-7 · BMC Medical Research Methodology · 2024-07-04

## TL;DR

This paper introduces automated methods and a Python tool to calculate incidence and prevalence in open cohort datasets, improving transparency and efficiency in public health research.

## Contribution

The novel contribution is a rule-based framework and a Python CLI tool for automating incidence and prevalence analysis in open cohorts.

## Key findings

- A code-free ruleset for incidence and prevalence analysis in open cohorts was developed.
- A Python CLI implementation was created to compute incidence and point prevalence time series.
- The ruleset can be adapted for other analytical questions like period prevalence.

## Abstract

Data is increasingly used for improvement and research in public health, especially administrative data such as that collected in electronic health records. Patients enter and exit these typically open-cohort datasets non-uniformly; this can render simple questions about incidence and prevalence time-consuming and with unnecessary variation between analyses. We therefore developed methods to automate analysis of incidence and prevalence in open cohort datasets, to improve transparency, productivity and reproducibility of analyses.

We provide both a code-free set of rules for incidence and prevalence that can be applied to any open cohort, and a python Command Line Interface implementation of these rules requiring python 3.9 or later.

The Command Line Interface is used to calculate incidence and point prevalence time series from open cohort data. The ruleset can be used in developing other implementations or can be rearranged to form other analytical questions such as period prevalence.

The command line interface is freely available from https://github.com/THINKINGGroup/analogy_publication.

The online version contains supplementary material available at 10.1186/s12874-024-02266-7.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11223317/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11223317/full.md

## References

20 references — full list in the complete paper: https://tomesphere.com/paper/PMC11223317/full.md

---
Source: https://tomesphere.com/paper/PMC11223317