# Integration of HIV Status in Cancer Surveillance in South Africa: A Call for Action

**Authors:** Carole Metekoua, Tracey Wiggill, Tinashe Tombe‐Nyahuma, Yann Ruffieux, Judith Mwansa‐Kambafwile, Stanford Kwenda, Tafadzwa G. Dhokotera, Julia Bohlius, Eliane Rohner, Mazvita Muchengeti

PMC · DOI: 10.1002/cam4.71661 · Cancer Medicine · 2026-02-26

## TL;DR

This paper highlights the importance of tracking HIV status in cancer patients in South Africa to improve care and cancer control.

## Contribution

The study integrates HIV status into cancer surveillance using natural language processing and record linkage in South Africa.

## Key findings

- HIV status documentation in cancer patients increased from 29% to 52% between 2004–2009 and 2016–2021.
- Infection-related cancers had a 75% HIV prevalence compared to 32% in unrelated cancers.
- Documentation rates were higher in females and lower in non-Black African population groups.

## Abstract

Human immunodeficiency virus (HIV) increases the risk of developing cancer. We aimed to assign HIV status to cancers diagnosed in public laboratories recorded in the National Cancer Registry (NCR) in South Africa, guided by HIV counselling and testing guidelines.

We used natural language processing to extract HIV‐related information from free‐text reports and probabilistic record linkage to match cancers diagnosed between 2004 and 2021 to HIV‐related tests from the National Health Laboratory Service Corporate Data Warehouse. We assigned HIV status based on the results of the HIV‐related tests and their timing relative to cancer diagnosis. We used descriptive statistics and logistic regression to examine HIV status documentation patterns and HIV prevalence in cancer patients.

Of the 496,517 cancers reported to the NCR, 41% (n = 203,937) had a documented HIV status. Documentation increased from 29% in 2004–2009 to 52% in 2016–2021. The odds of having a documented HIV status were 20% higher in females than in males and 16%–28% lower in other population groups compared with Black Africans. Patients with infection‐related cancers had almost threefold higher odds of having a documented HIV status than patients with infection‐unrelated cancers. Among cancer patients with documented HIV status, HIV prevalence was 75% for infection‐related and 32% for infection‐unrelated cancers.

HIV status documentation among people with cancer has improved over time, but it is still suboptimal. Clinicians and pathologists in HIV endemic areas need to improve HIV ascertainment at cancer diagnosis and reporting to cancer registries to inform patient care and guide cancer control efforts.

## Linked entities

- **Diseases:** cancer (MONDO:0004992)

## Full-text entities

- **Genes:** CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, PRL (prolactin) [NCBI Gene 5617] {aka GHA1, pPRL}
- **Diseases:** cytotoxicity (MESH:D064420), infection (MESH:D007239), sexually transmitted infections (MESH:D012749), Cervix (MESH:D002577), Drug Abuse (MESH:D019966), Uterus (MESH:D014594), AIDS-defining cancers (MESH:D009369), Diabetes (MESH:D003920), Lung, (MESH:D008171), immune-deficiency (MESH:D007154), Breast (MESH:D061325), carcinogenicity (MESH:D011230), cervical cancer (MESH:D002583), Alcohol Abuse (MESH:D000437), Colorectal, (MESH:D015179), chronic inflammation (MESH:D007249), oesophageal and prostate cancer (MESH:D011471), Melanoma (MESH:D008545), Kaposi sarcoma (MESH:D012514), Prostate (MESH:D011472), Allergy and Infectious Diseases (MESH:D003141), HPV- (MESH:D030361), HIV (MESH:D015658), Bladder (MESH:D001745), Hodgkin lymphoma (MESH:D006689), tuberculosis (MESH:D014376), non-Hodgkin lymphoma (MESH:D008228), AIDS (MESH:D000163), Non-Hodkgin lymphoma (MESH:D008223), Stomach (MESH:D013272), vulvar cancer (MESH:D014846), Digestive and Kidney Diseases (MESH:D007674)
- **Species:** Human papillomavirus (species) [taxon 10566], Homo sapiens (human, species) [taxon 9606], Human immunodeficiency virus (species) [taxon 12721], Human immunodeficiency virus 1 (no rank) [taxon 11676]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12945709/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12945709/full.md

## References

43 references — full list in the complete paper: https://tomesphere.com/paper/PMC12945709/full.md

---
Source: https://tomesphere.com/paper/PMC12945709