# The Sickle Africa Data Coordinating Centre (SADaCC): a data science hub for interdisciplinary sickle cell disease research and training

**Authors:** Ambroise Wonkam, Nchangwi Syntia Munung, Mario Jonas, Wilson Mupfurirwa, Arthemon Nguweneza, Kevin Esoh, Chandre Oosterwyk-Liu, Zimkita Magangana, Khuthala Mnika, Valentina Ngo Bitoungui, Martha Kamkuemah, Kambe Banda, Nabeelah Samie, Jade Hotchkis, Victoria Nembaware, Andre-Pascal Kengne, Nicola Mulder

PMC · DOI: 10.1093/database/baag007 · Database: The Journal of Biological Databases and Curation · 2026-02-20

## TL;DR

SADaCC is a data hub in Africa that collects and manages sickle cell disease data from thousands of patients to improve research and treatment.

## Contribution

SADaCC creates a standardized, FAIR-compliant data infrastructure for sickle cell disease research across multiple African countries.

## Key findings

- SADaCC coordinates data from over 40,000 SCD patients across seven African countries.
- The registry uses FAIR-compliant architecture and digital platforms for data integration.
- SADaCC supports training and research in genomics, bioethics, and AI-driven analytics for SCD.

## Abstract

Sickle cell disease (SCD) is one of the most prevalent monogenic disorders worldwide, with the highest burden in Africa, where ~75% of the 7.74 million global cases occur. Scientific progress in understanding its epidemiology, clinical heterogeneity, and treatment outcomes has been constrained by heterogeneous, non-standardized, and non-interoperable datasets that limit data integration and cross-country analyses. To address this, the Sickle Africa Data Coordinating Centre (SADaCC) was established as the data science hub of the SickleInAfrica consortium to support the development and expansion of Pan-African SCD registry. SADaCC now coordinates one of the largest patient-consented SCD datasets globally, with data from over 40 000 persons living with SCD in seven countries (Ghana, Mali, Nigeria, Tanzania, Uganda, Zambia, and Zimbabwe) within the Sickle Pan-African Research Consortium (SPARCo), as well as genomic data from SADaCC satellite sites in Cameroon, South Africa, and Malawi. The registry is built on FAIR-compliant architecture, the Sickle Cell Disease Ontology, and powered by a suite of digital platforms such as REDCap, NextCloud, RStudio, GitHub, Docker, and Jupyter. In partnership with SPARCo, SADaCC is also piloting a biobank that will link biospecimens with data in the registry to advance multi-omics research. Beyond infrastructure, SADaCC leads training and/or research in big data analytics, genomics, bioethics, implementation science, qualitative research, and psychosocial studies. Ethical, legal, and social considerations are embedded across all operations with emphasis on equitable intra-African collaboration and patient involvement in research. Looking ahead, SADaCC will integrate real-time data streams, AI-driven analytics, and multi-omics data to drive big data and genetic medicine research for SCD in Africa.

## Linked entities

- **Diseases:** sickle cell disease (MONDO:0011382)
- **Species:** Homo sapiens (taxon 9606)

## Full-text entities

- **Genes:** HBS1L (HBS1 like translational GTPase) [NCBI Gene 10767] {aka EF-1a, ERFS, HBS1, HSPC276, eRF3c}, BCL11A (BCL11 transcription factor A) [NCBI Gene 53335] {aka CTIP1, DILOS, EVI9, HBFQTL5, SMARCM1, ZNF856}, MYB (MYB proto-oncogene, transcription factor) [NCBI Gene 4602] {aka Cmyb, c-myb, c-myb_CDS, efg}
- **Diseases:** beta-thalassemia (MESH:D017086), hypoxic (MESH:D002534), monogenic disorders (MESH:D009358), haemolysis (MESH:D006461), genetic diseases (MESH:D030342), vaso-occlusive crises (MESH:D013224), pain crises (MESH:D010146), Disease (MESH:D004194), HbSS (MESH:D000090542), haemolytic anaemia (MESH:D000743), HbS (MESH:D000755), multi-organ damage (MESH:D000092124), malaria (MESH:D008288), SADaCC (MESH:D001259), abnormal haemoglobin (MESH:D000014)
- **Chemicals:** hydroxyurea (MESH:D006918)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** Glu6Val

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12923167/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12923167/full.md

## References

52 references — full list in the complete paper: https://tomesphere.com/paper/PMC12923167/full.md

---
Source: https://tomesphere.com/paper/PMC12923167