HPV profiles in Botswana: An analysis of healthy women, cervical intraepithelial neoplasia, and invasive cervical cancer
Caroline Kernell, Emily MacDuffie, Xiang Lin, Le Gao, Doreen Ramogola-Masire, Surbhi Grover, Erle Robertson

TL;DR
The study analyzes HPV profiles in Botswana women, showing higher prevalence of high-risk HPV in cervical intraepithelial neoplasia and invasive cervical cancer compared to healthy women.
Contribution
The study identifies specific high-risk HPV subtypes associated with progression from pre-cancerous lesions to invasive cervical cancer in Botswana.
Findings
High-risk HPV prevalence increases from healthy women to those with cervical intraepithelial neoplasia and invasive cervical cancer.
HPV subtypes like HPV 16 and 18 are more prevalent in invasive cervical cancer compared to cervical intraepithelial neoplasia.
Multiple high-risk HPV strains may contribute to the progression from pre-cancerous lesions to cervical cancer.
Abstract
•Widespread HIV and HPV in Sub-Saharan Africa contributes to the growing burden of cervical cancer.•Compared to healthy students, women with CIN or invasive cervical cancer have a higher prevalence of high-risk (HR) HPV.•Compared to women with CIN, women with invasive cervical cancer have higher prevalence of HR HPV.•Cellular HPV viral burden remains unchanged between healthy students, women with CIN, and women with cervical cancer.•Regardless of viral burden, further acquisition of HR HPV in those with CIN is important in progression to cervical cancer. Widespread HIV and HPV in Sub-Saharan Africa contributes to the growing burden of cervical cancer. Compared to healthy students, women with CIN or invasive cervical cancer have a higher prevalence of high-risk (HR) HPV. Compared to women with CIN, women with invasive cervical cancer have higher prevalence of HR HPV. Cellular HPV…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCervical Cancer and HPV Research · Hepatitis B Virus Studies
Introduction
1
Human papilloma virus (HPV) is a sexually transmitted infection that causes cervical cancer (Zur Hausen, 1994, Bosch et al., 1995, Szymonowicz and Chen, 2020). Most cervical cancers are associated with HPV subtypes 16 and 18, however in women with human immunodeficiency virus (HIV) infection, a larger proportion are associated with other high-risk subtypes. Co-infection of HIV with HPV is associated with a 6-fold increase in cervical cancer risk (Stelzle et al., 2021). Because the burden of HIV infection is concentrated in low- and middle-income countries (LMICs), understanding HPV pathogenesis in high-HIV settings is critical to address the global cervical cancer burden. A majority of cervical cancer deaths occur in LMICs in Sub-Saharan Africa (SSA) (Bray et al., 2018). Botswana, a middle-income country of 2.6 million people, has a high cervical cancer incidence, driven by its high HIV infection rate of 20 % (Botswana U, xxxx, xxxx). Since the implementation of free antiretroviral therapy (ART) for citizens in 2002, 84 % of patients living with HIV have access to ART and viral suppression is estimated to be 96 % (Ramogola-Masire et al., 2020). With recent expansion in HPV vaccination and screening programs, there were 57 VIA clinics, 45 cryotherapy clinics, and 35 colposcopy LEEP clinics in Botswana as of 2021 (Luckett et al., 2023). For locally-advanced cervical cancer patients, a gynecological multi-disciplinary team clinic (MDT) was established in 2015 to streamline care and reduce treatment delays for women with gynecological malignancies (Grover et al., 2023, Grover et al., 2017). Despite these recent interventions, cervical cancer remains the leading cause of cancer-related death in Batswana women (Grover et al., 2015).
Studies from SSA examining HPV prevalence in cervical cancer patients have primarily focused on high-risk subtypes and lacked quantitative data beyond presence or absence (Dols et al., 2012, Mpunga et al., 2020, Menon et al., 2016, Okoye et al., 2021). Similarly in Botswana, the prevalence of high-risk subtypes have been described in several different demographic groups. A cross-sectional study reported over 60 % of unvaccinated university-age women carried HPV, with 16 % carrying HPV 16 or 18, and 44 % carrying a non-high risk subtype (Ramatlho et al., 2022). Among patients in Botswana with invasive cervical cancer, forty HPV subtypes have been identified, with the most prominent subtypes being high-risk subtypes HPV 34 and 26 (Grover et al., 2023). These studies have notably been limited in their ability to quantify the presence of individual subtypes, hindering their ability to compare the distribution of viral burden in different populations.
This study recorded the prevalence and quantified the burden of 5 high-risk and 35 low-risk HPV subtypes among three distinct demographic cohorts of women in Botswana to better understand the shifts in HPV pathogenesis that may impact subsequent development of cervical dysplasia and invasive cancer. These three cohorts included healthy, HPV-unvaccinated university-aged women, women diagnosed with precancerous cervical intraepithelial neoplasia (CIN), and women diagnosed with invasive cervical cancer. This study used a novel enhanced HPV subtyping technology to quantitatively compare HPV burden and prevalence across these cohorts of women in Botswana. Describing the landscape of HPV subtype distribution in diverse cohorts with empirical data is critical to guiding efforts at HPV prevention with region-appropriate vaccination initiatives.
Materials and methods
2
Cohort enrollment and data collection
2.1
The Ipabalele study and enrolled three cohorts of subjects between 2016–2020 in Botswana (Grover et al., 2019). The FHUS Cohort consisted of female healthy university students (FHUS) over 18 years old. Baseline demographic data, urine sample, HIV test, and baseline cervical swab were collected. Three additional cervical swabs were collected at 3–6 months, 9–12 months, and 16–20 months. The CIN II/III Cohort consisted of women with histologically confirmed high grade CIN (CIN2/3). Baseline demographic data and cervical swab were obtained. The CaCx Cohort consisted of women with histologically confirmed invasive cervical cancer (CaCx) receiving curative-intent treatment. Participants were enrolled at Princess Marina Hospital’s women’s gynecological health clinic as well as at Gaborone Private Hospital. Baseline and follow-up patient and disease characteristics, treatment toxicities, blood, and cervical swab were obtained.
PathoChip
2.2
The PathoChip is a functional microarray assay with probes able to detect both unique and conserved genomes of numerous organisms. The use of PathoChip was previously documented in studies (Banerjee et al., 2015, Baldwin et al., 2014). Using these probes, we tested for high-risk HPV subtypes 16, 18, 26, 34, and 53; and low-risk subtypes 1, 2, 4, 5, 6b, 7, 9, 10, 32, 41, 48, 49, 50, 54, 60, 61, 63, 88, 90, 92, 96, 101, 103, 108, 109, 112, 116, 121, 126, 128, 129, 131, 132, 134, and 148. HPV 41 was not among the signature probes tested for The CIN II/III Cohort.
Sample storage, preparation, and microarray processing
2.3
Cervical swab samples were stored in phosphate-buffered saline solution at −80 °C until processing. The array design and screening procedure for PathoChip has been previously described in multiple studies (Grover et al., 2023, Banerjee et al., 2015, Baldwin et al., 2014, Ramogola-Masire et al., 2022, Raesima et al., 2015, Banerjee et al., 2018, Banerjee et al., 2017, Banerjee et al., 2017).
Microarray data extraction and statistical analysis
2.4
Data extraction and analysis after detection of signals using Agilent scanner D was previously described. A description of the analysis pipeline was also previously described (Grover et al., 2023, Banerjee et al., 2015, Baldwin et al., 2014, Ramogola-Masire et al., 2022, Raesima et al., 2015).
Clinical data analysis
2.5
Hybridization signal intensity (HSI) and prevalence was determined for each HPV subtype by cohort. Prevalence was determined by tallying samples with signals exceeding the average signal of the human probes (using a cutoff) and expressing it as a percentage of the total sample count. The FHUS Cohort samples from the four timepoints was compared using a Chi-squared test. The CIN II/III Cohort and 3 samples were stratified by HIV status and compared using a Chi-squared test. The distributions of the different HPV subtypes were compared between the three cohorts, severity of cervical dysplasia, and FIGO stage of cervical cancer. All analysis was conducted using R version 4.1.0.
Ethics
2.6
This study was approved by the University of Pennsylvania (IRB # 822837), the Human Research Development Council at the Ministry of Health and Wellness in Botswana, and the University of Botswana institutional review boards.
Results
3
Demographic and clinical characteristics
3.1
A total of 414 patients were included in this study, full demographic characteristics are shown in Table 1. The FHUS Cohort included 43 patients with a median age of 19 (IQR 19–19.5). Women living with HIV (WLWH) accounted for 0 % (n = 0) of the FHUS Cohort. The CIN II/III Cohort included 212 patients with a median age of 39 (IQR 35–45). WLWH accounted for 76 % (n = 161) of the CIN II/III Cohort. The CaCx Cohort included 159 patients with a median age of 46 (IQR 41–53). WLWH accounted for 72 % (n = 115) of the CaCx Cohort.Table 1. Demographic characteristics of Cohorts 1, 2, and 3.TotalCohort 1Cohort 2Cohort 3AllAllHIV +HIV −AllHIV +HIV −NPercentage (%)NPercentage (%)NPercentage (%)NPercentage (%)NPercentage (%)NPercentage (%)NPercentage (%)43100.0212100.0161100.051100.0159100.0115100.044100.0Age (y)</=2043100.000.000.000.000.000.000.020–4000.06430.24930.41529.43421.42824.3613.640–6000.05023.64125.5917.610767.38170.42659.1>6000.020.921.200.01811.365.21227.3Median19394036464552IQR19–19.535–4536–4633–4541–5341–50456–62 Height (cm)</=15012.320.921.200.095.787.012.3150–1601432.65325.04326.71019.65534.64135.71431.8160–1702251.25425.54125.51325.56742.14740.92045.5>170614.073.363.712.0148.887.0613.6Median161.4161.0161.0161.0161.5161.0163.0IQR156.4–166.2157–165157–165158.8–165156.2–167155.9–164.8158.5–167 Weight (kg)</=501841.9125.7116.812.01610.11513.012.350–601227.93114.62515.5611.83220.12723.5511.460–70818.63416.03119.335.93018.92521.7511.4>70511.63918.42515.51427.56742.13833.02965.9Median54.964.062.171.968.964.979.3IQR46.5–6455.9–73.154.2–71.360–78.456.4–79.854.6–74.468.3–88.9 BMIUnderweight (<18.5)920.962.863.700.0106.3108.700.0Healthy weight (18.5–24.9)2762.85325.04427.3917.65534.64640.0920.5Overweight (25.0–29.9)49.33717.52918.0815.73622.62723.5920.5Obese (>/=30)37.0209.4138.1713.73924.51815.72147.7Median20.925.024.528.025.724.731.1IQR18.7–23.821.1–27.920.9–27.023.6–30.922.4–31.121.5–29.225.7–34.0 Age of Sexual Debut (y)</=181739.55325.04125.51223.58452.86052.22454.5>181125.64923.14024.8917.65434.03732.21738.6No sex1534.911051.98049.73058.82113.21815.736.8Median18181818181818IQR18–1917–2017–2018–2017–2017–2017–20 Number of Sexual Partners11637.231.431.900.01710.776.11022.7>11227.99946.77848.42141.211169.88170.43068.201534.911051.98049.73058.82113.21815.736.8Median1553333IQR1–23–73–63–72–52–42–5 Smoking StatusYes614.094.253.147.831.921.712.3No3786.010650.08653.42039.215698.111398.34397.7
High-Risk HPV prevalence and burden
3.2
All five high-risk subtypes tested (16, 18, 26, 34, 53) were detected across the three cohorts. The prevalence of these subtypes rose markedly from healthy university students to women with CIN and then to those with invasive cervical cancer (Fig. 1). In the FHUS Cohort, baseline prevalence of HPV 16 and 18 was low (26 % and 19 %), with HPV 26 being the highest (65 %). Among the high-risk subtypes tested, all showed a statistically significant increase in prevalence from baseline to 16–20 months (p < 0.05), with HPV 26 remaining the most prevalent (88 %) (Supplement Table 1). In contrast to the FHUS Cohort, in the CIN II/III Cohort prevalence of HPV 16, 18, 26, 34, and 53 ranged from 78 %-99 %, and in the CaCx Cohort nearly all women carried HPV 16 (100 %) and HPV 18 (98 %) (Supplement Tables 2-3). Compared with the CIN II/III Cohort, the CaCx Cohort showed significantly higher prevalence of HR HPV subtypes HPV 16 and 18 (p < 0.05), with the CIN II/III Cohort having a higher prevalence of other HR HPV 26 and 34 (p < 0.05) (Fig. 1). When stratified by HIV status, WLWH in the CaCx Cohort had higher prevalence of HPV 16 and 18 than WLWH in the CIN II/III Cohort (p > 0.05), and HIV-negative women in the CaCx Cohort had a higher prevalence of HPV 16 and 18 than those in the CIN II/III Cohort (p < 0.05) (Fig. 2A and B). HIV status among the cohorts themselves by HPV subtype are detailed in Supplement Tables 2-3. Hybridization signal intensity (HSI), a surrogate of viral burden, did not differ significantly among cohorts (Fig. 3) or by HIV status (Fig. 4A and B).Fig. 1. Prevalence of HPV subtypes in Cohorts 1, 2, and 3.Fig. 2. Prevalence of HPV subtypes in Cohorts 1, 2, and 3 by HIV status. 2A: HPV subtypes in WLWH, 2B: HPV subtypes in women without HIV.Fig. 3HSI of HPV subtypes in Cohorts 1, 2, and 3.Fig. 4HSI of HPV subtypes in Cohorts 1, 2, and 3 by HIV status. 4A: HPV subtypes in WLWH, 4B: HPV subtypes in women without HIV.
Low-Risk HPV prevalence and burden
3.3
All thirty-five low-risk HPV subtypes tested were detected across the three cohorts. The prevalence of these subtypes was highest in women with CIN and remained high in those with invasive cervical cancer (Fig. 1). In the FHUS Cohort, the most common LR HPV subtypes at baseline were HPV 61 and 126 (84 % each), followed by a broad range of other low-risk subtypes, many of which increased significantly in prevalence at 16–20 months (p < 0.05) (Supplement Table 1). Between baseline and 16–20 months, most low-risk subtypes showed a new increase in prevalence, though a few (HPV 60, 103, 112, 126, 129, and 148) decreased. In contrast to the FHUS Cohort, the CIN II/III Cohort prevalence of most low-risk subtypes exceeded 85 %, with HPV 6b detected in 100 % of patients and only HPV 90 below 75 % (Supplement Table 2). The CaCx Cohort remained consistent with the CIN II/III Cohort, with 97 % of patients having HPV 6b infection (Supplement Table 3). Compared with the CIN II/III Cohort, the CaCx Cohort showed a significantly higher prevalence of HPV 90 (p < 0.05), with most other LR HPV types being significantly more prevalent in the CIN II/III Cohort (p < 0.05). (Fig. 1). When stratified by HIV status, WLWH in the CIN II/III Cohort had a significantly higher prevalence of low-risk subtypes HPV 48, 49, 60, 109, 121, 134 when compared to the CaCx Cohort (p < 0.05), while HIV-negative women in the CIN II/III Cohort had higher prevalence of numerous low-risk subtypes compared to both Cohorts 1 and 3 (Fig. 2A and B). HIV status among the cohorts themselves by HPV subtype are detailed in Supplement Tables 2-3. HSI did not differ significantly among cohorts (Fig. 3) or by HIV status (Fig. 4A and B).
Discussion
4
This study compared the distribution, prevalence, and burden of HPV subtypes between healthy university students, patients with CIN 2/3, and patients with invasive cervical cancer undergoing curative treatment in Botswana.
HPV subtype distribution across the cervical cancer spectrum
4.1
Our data show a rise in prevalence of HR HPV from healthy university students to women with CIN 2/3 and then to those with cervical cancer, while viral burden (HSI) remained stable across cohorts and HIV status. This finding reinforces the concept that persistent infection with oncogenic HPV subtypes, rather than absolute viral load, drives progression to cervical cancer. This finding may suggest that ART, which has a high level of adherence in Botswana, surpassing 95 % since the inception of a widespread HIV testing and ART program implemented by the government in Botswana in 2002, (Development CfGCfG, 2016) may be playing a protective role in controlling HPV viral load, even though HIV infection remains a recognized risk factor for cervical cancer (Stelzle et al., 2021).
The FHUS Cohort: University students and high-risk HPV acquisition
4.2
The HPV subtype prevalence seen in the FHUS Cohort builds on the results of previous studies of university students in Botswana. A 2022 study investigated for the first time the prevalence of HPV in unvaccinated female university students, finding that 60 % of students were infected with HPV (Ramatlho et al., 2022). Within this population, HPV 26 was the most prevalent, paired with gradual rises in HPV 16 and 18. A mid-study dip followed later by an increase in HPV prevalence suggests both spontaneous clearance and repeated acquisition of infection, as younger women are more likely to clear HPV infection compared to older women (Erickson et al., 2013). These observations further support the continued need for HPV vaccination among this population. Although Botswana initiated the quadrivalent HPV vaccine program for school-going girls ages 9–13 in 2015, uptake was only 47 % as of 2019 (Department SR, 2016). Thus, HPV infection and re-infection remains an issue of high concern for university-age students in Botswana.
Comparison of CIN and invasive cancer
4.3
Women with CIN were infected with a broad spectrum of both HR and LR HPV subtypes, whereas those with invasive cervical cancer demonstrated dominance of HPV 16 and 18. Low-risk subtypes were more prevalent in CIN and less prevalent in invasive disease, supporting the view that malignant transformation is characterized by selective persistence of high-risk HPV subtypes rather than overall diversity of infection. However, there is extreme genetic heterogeneity in the distribution of LR HPV subtypes among the 3 cohorts, suggesting a need for further investigation on the differential impact of LR HPV combinations and their influence on carcinogenesis. These trends were similar in women with and without HIV, indicating that, although WLWH are 6 times more likely to develop cervical cancer, they do not require a lower viral burden of HPV to develop CIN compared to their HIV negative counterparts. This point once again emphasizes the need for HPV prevention and CIN detection, because regardless of HIV status, women are at equal risk for development of these lesions.
Implications for vaccination and screening
4.4
Botswana’s high ART coverage and the success of its HPV vaccination program represent major public health advances, yet a substantial population still remains at risk for HPV exposure. Girls vaccinated in 2015 are now in their mid-20 s, leaving a large population of unvaccinated young adults still vulnerable to HPV and cervical cancer. The subtypes in the FHUS Cohort also suggest the need to shift to the nonavalent vaccine, which covers HPV 6, 11, 16, 18, 31, 33, 45, 52, and 58. Although this study did not examine all subtypes the vaccine covers, the rising prevalence of non-16/18 HR HPV in young women indicates other subtypes, such as HPV 34, are contributing to disease. Importantly, this study shows that acquisition of high-risk HPV in women with CIN, regardless of HIV status or viral burden, is key in progressing to cancer. This supports early interventions, including vaccinations. Strengthening screening programs and ensuring a high uptake for vaccination will be essential to further reduce the cervical cancer burden in Botswana.
Strengths and limitations
4.5
A strength of the study is its use of PathoChip, which detects extremely low viral loads and enables detailed analysis of lesser-known low-risk subtypes. This technology allowed us to capture a broader and more nuanced picture of HPV diversity than is typically possible with standard assays. Additional strengths include the prospective design of the university cohort and the inclusion of both HIV positive and negative patients.
Limitations of this study include potential recall bias with self-reported sexual history, incomplete HPV subtype coverage, and lack of timing data between HIV infection and cervical disease progression. Additionally, since younger women are more likely to clear HPV infections, age differences may possibly account for lower HPV prevalence in the university cohort.
Conclusions
5
This study analyzed a wide array of HPV subtypes and prevalence over three different cohorts in Botswana. HR HPV prevalence in university students increased significantly over 20 months. We have also demonstrated a dramatic increase in HPV burden between young and middle-aged women, with women with cervical cancer having the highest prevalence of HR HPV 16 and 18. Lastly, this study also shows the extreme genetic variability among HPV subtypes in this population, and further supports the need for continued and expanded vaccination and added screening protocols for Botswana.
CRediT authorship contribution statement
Caroline Kernell: Writing – review & editing, Writing – original draft, Visualization. Emily MacDuffie: Writing – review & editing. Xiang Lin: Methodology, Investigation, Formal analysis, Data curation, Conceptualization. Le Gao: Methodology, Investigation, Formal analysis, Data curation. Doreen Ramogola-Masire: Writing – review & editing. Surbhi Grover: Writing – review & editing, Supervision, Funding acquisition. Erle Robertson: Writing – review & editing, Supervision.
Declaration of competing interest
The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: SG reports consulting fees for Lumonus, Harbinger Health, and Sustainable Dialogue for Peaceful Uses (SDPU) for serving as a scientific advisor. All other authors report no conflict of interest.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Baldwin D.A.Feldman M.Alwine J.C.Robertson E.S.Metagenomic assay for identification of microbial pathogens in tumor tissues M Bio 552014 e 01714-1410.1128/m Bio.01714-14PMC 417207525227467 · doi ↗ · pubmed ↗
- 2Banerjee S.Wei Z.Tan F.Distinct microbiological signatures associated with triple negative breast cancer Sci. Rep.52015151622646922510.1038/srep 15162 PMC 4606812 · doi ↗ · pubmed ↗
- 3Banerjee S.Tian T.Wei Z.Microbial signatures associated with oropharyngeal and oral squamous cell carcinomas Sci. Rep.71201740362864260910.1038/s 41598-017-03466-6PMC 5481414 · doi ↗ · pubmed ↗
- 4Banerjee S.Tian T.Wei Z.The ovarian cancer oncobiome Oncotarget 822201736225362452841023410.18632/oncotarget.16717 PMC 5482651 · doi ↗ · pubmed ↗
- 5Banerjee S.Tian T.Wei Z.Distinct microbial signatures associated with different breast cancer types Front. Microbiol.920189512986785710.3389/fmicb.2018.00951 PMC 5962706 · doi ↗ · pubmed ↗
- 6Bosch F.X.Manos M.M.Muñoz N.Prevalence of human papillomavirus in cervical cancer: a worldwide perspective. international biological study on cervical cancer (IBSCC) study Group J. Natl Cancer Inst.87111995796802779122910.1093/jnci/87.11.796 · doi ↗ · pubmed ↗
- 7Botswana U. HIV: Every child has the right to survive and thrive. https://www.unicef.org/botswana/hiv.
- 8Bray F.Ferlay J.Soerjomataram I.Siegel R.L.Torre L.A.Jemal A.Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries CA Cancer J. Clin.68620183944243020759310.3322/caac.21492 · doi ↗ · pubmed ↗
