# German Million Children Cohort: a historical birth cohort based on claims data to investigate the impact of immunisation and other early life factors on the risk of cancer and other diseases in childhood – cohort profile

**Authors:** Lara Kim Brackmann, Loviisa Mulanje, Sophie Langbein, Bianca Kollhorst, Ulrike Haug, Wolfgang Ahrens, Rajini Nagrani, Manuela Marron

PMC · DOI: 10.1136/bmjopen-2025-113411 · BMJ Open · 2026-03-26

## TL;DR

This study uses a large historical dataset of children in Germany to explore how early life factors like vaccinations affect childhood cancer and other diseases.

## Contribution

The German Million Children Cohort is a new historical register-based cohort using claims data to study the long-term health effects of early life factors.

## Key findings

- The cohort includes 2,023,613 children with 3,410 childhood cancer cases identified.
- 38.9% of children received all recommended vaccines by age 30 months.
- The cohort allows for future studies on the impact of the COVID-19 pandemic on child health.

## Abstract

As a historical register-based birth cohort, the German Million Children Cohort was set up using data from the German Pharmacoepidemiological Research Database (GePaRD), a health insurance database covering ~20% of the German population. The first cohort project aims to investigate the impact of Prenatal and Childhood Immunization and the Risk of Childhood Cancer project.

The cohort includes all newborns in GePaRD from 2004 to 2018 and is followed until occurrence of childhood cancer, death, the end of continuous insurance or currently available data years (31 December 2022). Information on vaccinations and confounders is collected from the inpatient and outpatient settings. For most children, information on vaccination exposure during pregnancy can be obtained through mother-child linkage.

The cohort includes 2 023 613 children with a maximum follow-up of 18 years. This results in 17 752 995 person-years at risk and 3410 cases of childhood cancer. Maternal linkage was possible for 78.3% of children. The vaccination coverage for both nationally recommended and other vaccines was assessed in the cohort. By age 30 months, 38.9% of children received all recommended vaccines, 52.4% received incomplete vaccinations and 8.8% remained unvaccinated. Regarding further characteristics, 57.6% of children were from families with higher education, 20.3% were hospitalised due to a severe infection during follow-up and 74.5% received at least one antibiotic prescription.

While our first study with this historical cohort is dedicated to the question of whether immunity to childhood infections that are covered by the recommended baseline vaccination scheme reduces the risk of childhood cancer, the cohort will offer further opportunities. This includes the evaluation of additional factors potentially influencing the risk of cancer and other diseases in childhood, as well as an extension of the inclusion period beyond 2018 to also include children born during the COVID-19 pandemic.

## Linked entities

- **Diseases:** childhood cancer (MONDO:0006517)

## Full-text entities

- **Diseases:** pertussis (MESH:D014917), CNS tumours (MESH:D016543), infertility (MESH:D007246), neurocognitive impairment (MESH:D019965), systemic sclerosis (MESH:D012595), purpura anaphylactoides (MESH:D011695), cardiovascular or pulmonary diseases (MESH:D002318), ataxia-telangiectasia (MESH:D001260), Noonan syndrome (MESH:D009634), Behcet's disease (MESH:D001528), Werner syndrome (MESH:D014898), Wegener's granulomatosis (MESH:D014890), juvenile arthritis (MESH:D001171), AML (MESH:D015470), rotavirus (MESH:D012400), death (MESH:D003643), rubella (MESH:D012409), non-neuropathic hereditary familial amyloidosis (MESH:D028227), microscopic polyangiitis (MESH:D055953), Bloom syndrome (MESH:D001816), hepatitis B (MESH:D006509), infection (MESH:D007239), diphtheria (MESH:D004165), Fanconi anaemia (MESH:D000743), tetanus (MESH:D013746), COVID-19 (MESH:D000086382), tuberous sclerosis (MESH:D014402), dermato-/polymyositis (MESH:D017285), poliomyelitis (MESH:D011051), trisomy 21 (MESH:D004314), Beckwith-Wiedemann syndrome (MESH:D001506), CML (MESH:D015464), Panarteritis nodosa (MESH:D010488), Kawasaki disease (MESH:D009080), varicella (MESH:D002644), Leukaemia (MESH:D015458), Nijmegen breakage syndrome (MESH:D049932), Malignant brain tumours (MESH:D001932), lymphomas (MESH:D008223), Takayasu syndrome (MESH:D013625), Crohn's disease (MESH:D003424), measles (MESH:D008457), Cancer (MESH:D009369), mumps (MESH:D009107), ulcerative colitis (MESH:D003093), chronic myeloid leukaemia (MESH:D015451), ALL (MESH:D054218), H. influenzae type b (MESH:D006192), lupus erythematosus (MESH:D008180)
- **Chemicals:** benzene (MESH:D001554)
- **Species:** Homo sapiens (human, species) [taxon 9606], Rotavirus (genus) [taxon 10912]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13034273/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13034273/full.md

## References

46 references — full list in the complete paper: https://tomesphere.com/paper/PMC13034273/full.md

---
Source: https://tomesphere.com/paper/PMC13034273