Determinants of malaria infection across different districts of Khyber Pakhtunkhwa, Pakistan: a cross-sectional study
Ijaz ul Haq, Zafar Mehmood, Amir Muhammad, Sohail Akhtar, Elmuez Alsir Ahmed Aboagarib, Afia Zaib, Sara Awadelkarim Mohammed Ahmed, Mashael Huwaikem, Gausal Azam Khan, Humood Fahm Albugami, Bilal Ahmed, Shenqiang Qu

TL;DR
This study identifies key risk factors for malaria in Pakistan's Khyber Pakhtunkhwa province, including poor housing, low income, and lack of healthcare access.
Contribution
The study uses model selection techniques to identify specific socio-demographic and environmental determinants of malaria in a cross-sectional survey.
Findings
Malaria prevalence was 24.5% among 768 participants in Khyber Pakhtunkhwa.
Key risk factors included low income, poor housing, and distance to healthcare facilities.
Age and family type were strongly associated with malaria infection.
Abstract
Malaria remains a significant public health challenge, particularly in the Khyber Pakhtunkhwa (KP) province, where transmission patterns vary across districts. This study aimed to identify the determinants of malaria transmission using model selection techniques. A cross-sectional study was conducted among general population across various districts in KP, including northern, central, and southern zones. A total of 768 participants across the province were surveyed using a structured questionnaire having socio-demographic variables, housing conditions, access to healthcare, and preventive practices. Malaria infection was diagnosed using the combo rapid diagnostic test. Logistic regression models, including automated model selection techniques, were employed to identify significant predictors of malaria. The overall proportion of malaria among respondents was 24.5%. Bivariate analysis…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
- —https://doi.org/10.13039/501100004686Deanship of Scientific Research, King Faisal University
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMalaria Research and Control · Digital Imaging for Blood Diseases · Statistical Methods in Epidemiology
Introduction
Infectious diseases remain significant challenges in the field of global health, ones that require careful thought and multifaceted approaches [1]. Since infectious diseases like malaria affect millions of people worldwide and have a considerable economic impact on the areas in which they occur, they pose a severe threat to public health [2]. Due to the global spread of its vectors, malaria is recognized as a mosquito-borne illness that spreads quickly and has a significant impact on morbidity and death,as, it poses a global public health concern [2, 3]. As per the WHO report 2022, there was an endemic prevalence of malaria in Pakistan between January and August 2022, with over 3.4 million suspected cases registered [4], as opposed to 2.6 million cases reported in 2021. Approximately 170,000 instances were confirmed in a lab setting, with 77% of those cases being caused by Plasmodium vivax and 23% by Plasmodium falciparum, which is linked to the deadliest and most severe cases [5]. According to the World Health Organisation, 84% of malaria cases in Pakistan are caused by P. vivax, with P. falciparum accounting for 14.9% of cases and mixed cases (double infections with P. vivax and P. falciparum) for 1.1% of cases [6].
Despite all of the disease control measures that the relevant authorities have developed and implemented, malaria still has a substantial impact on developing countries like Pakistan [7]. Malaria is more common disease in the tropics and subtropical regions [8]. Proper and early detection of malaria help in the reduction of transmission [9]. Human Plasmodium infections can present with a broad spectrum of symptoms, from mild malaria to severe cases. The 2015 WHO criteria for severe malaria include the following symptoms: hypoglycemia, jaundice, pulmonary edema, hemorrhage, numerous convulsions, prostration, acute renal failure, metabolic acidosis or acute respiratory distress, severe anemia, and pulmonary edema [10].
The occurrence, transmission, seasonality, and periodicity of malaria in particular—as well as the geographical and temporal distribution, severity, and duration of infectious diseases in general—are all influenced by socio-environmental factors [11].Understanding malaria dynamics in the context of local socioeconomic and meteorological conditions helps identify the significance of local variables driving malaria variability across time and space [12]. Poorly constructed homes allow easy entry for vectors carrying Plasmodium, increasing the risk of infection for household members. Low-income individuals often struggle to afford medications, related medical expenses, insecticide-treated bed nets (in countries where they are not distributed free of charge), and chemicals for indoor spraying [13].
There are numerous challenges in combating recurrent malaria in Pakistan. It is essential to assess the risk factors associated with malaria across different localities in Khyber Pakhtunkhwa (KP), Pakistan. Therefore, we conducted a study with the primary objective of identifying the determinants contributing to malaria transmission. The findings will help communities adopt effective preventive measures and will provide policymakers with valuable insights to develop informed strategies for malaria prevention in KP.
Methods
Study design and settings
The cross-sectional study was conducted from Jan 2022- Dec 2022 in selected districts of KP. This community-based study involved respondents from various districts of KP, including Dera Ismail Khan, Bannu, Karak, and Tank (south zone); Haripur, Abbottabad, Dir, Kohistan, Swat, and Chitral (north zone); and Peshawar, Charsadda, Nowshera, and Mardan (central zone). Inclusion criteria included participants who were willing to participate, available at home at the time of data collection, from any age group, and with no restrictions based on fever history.
Sample size and sampling strategy
A representative sample of respondents was selected from various districts of KP. We used a two-stage sampling design. In the first stage, Primary Sampling Units (PSUs) were selected using probability proportional to size (PPS) sampling, purposively considering the severity of the disease and the prevailing law and order situation. In the second stage, Secondary Sampling Units (SSUs) were identified for the selection of index respondents. SSUs were selected using random-cum-systematic random sampling. The sample size was calculated using a formula.
( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$n={z}^{2}\left\{pq\right\}/{e}^{2}$$\end{document} ), yielding an initial sample size of 384. However, considering the design effect (n x d), the final sample size was 768[14]. Design effect quantifies how much the actual sampling design increases or decreases the variance of your estimate compared to simple random sampling. A design effect of 2 was applied to the sample size obtained from the sample size formula to account for cluster sampling, which is a commonly used threshold value at a 95% confidence interval [14]. The sample was allocated Supplementary Table 1. The allocation of samples for socio-economic and demographic data was based on district population and reported malaria cases. Districts with higher populations and disease incidence were assigned larger samples, while those with lower malaria prevalence received smaller sample allocations.
Data collection procedure
A pretested, structured questionnaire adapted from the WHO core standard questionnaire was used to collect data on socio-economic and demographic factors related to malaria [14]. Trained enumerators collected data by Kobo App (https://www.kobotoolbox.org/).
Definition of malaria and study variables
Malaria was defined as a measured fever of ≥ 37.5 °C within the past 48 h accompanied by a positive blood smear for Plasmodium falciparum [15]. To confirm a malaria case, combo RDTs and microscopy were used interchangeably, depending on their availability at the laboratory. Explanatory variables included socio-economic and demographic characteristics such as age, sex, marital status, family size, education, occupation, family income, household type, distance to the nearest health center, water source, bed net availability and use, latrine availability, kind of utensils used for storing water, drinking water quality, and availability of any kind of livestock in home [14].
Statistical analysis
Data were checked, verified, and analyzed using statistical software (R). Descriptive statistics, odds ratios, and logistic regression models (binary and multiple) were used to analyze the data.
A household-level random effect was included because more than one participant could be sampled from the same household. To account for variability across districts, a district-level random effect was also incorporated into the multivariable logistic regression model. The logistic regression model was used to examine the relationships between the binary dependent variable and socio-economic variables. Automated model selection method using R with \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\alpha }_{in}=0.07 and {\alpha }_{out}=0.18$$\end{document} were used for the final determinants of malaria**.** A p-value < 0.05 was considered significant.
Results
Demographic and socio-economic characteristics of the study population
Table 1 displays the percentage distribution of socio-economic variables. The total number of individuals included in the study were 768. The majority (69.4%) were males, 34.5% were between 30 to 50 years of age, 39.1% lived in semi-pucca houses (partially, 54.6% lived in nuclear families, and 46.2% had a small family size. Regarding education, 44% were below secondary school certificate (SSC). As for other characteristics, 55% were self-employed, the majority (37%) of respondents lived at a distance of 1–3 km from the nearest health center, 47.1% of households had an income above 50,000 rupees, 44.3% kept livestock, 66.9% had a safe flush toilet, and only 36.3% of respondents used bed nets while sleeping outdoors. Table 1. Percentage distribution of socioeconomic and demographic variables of the study populationCharacteristicsN%GenderMale53369.4Female23530.6Age (in years) < = 1514719.116–3016621.631–5026534.5 > 5019024.7Family Size (individuals)Small (1–5)35546.2Medium (5–8)22429.2Large (> 8)18924.6Family TypeJoint34945.4Nuclear41954.6Marital StatusUnmarried37548.8Married39351.2Education of the Respondents < SSC33844.0 = > SSC43056.0OccupationPeasant17322.5Employed13817.9Self-employed42355.0Un-Employed354.5Income (RS) < 20,00023330.320,000–5000017322.5 > 50,00036247.1Availability of any Kind Of Livestock at householdYes34044.3No42855.7House Typepucca17823.2Semi pucca30039.1Kacha29037.8Modern Toilet facility availableYes51466.9No25433.1Kind Of Utensils Use For Storing WaterWithout Lid28937.7With Lid47962.3Drinking Water QualityNot Improved36547.5Improved40352.5Bed Net UseNo/Partial48963.7Yes27936.3
Supplementary table shows the proportion of participants malaria positive in each district. The results revealed that proportion of malaria was 24.5% (n = 188) out of total 768 respondents.
Bi-variate logistic regression analysis for malaria and its associated risk actors
Bivariate logistic regression analysis with odds ratios is given in Table 2. The results revealed that males were 1.73 times more likely to suffer from malaria compared to females (OR = 1.73 (1.18–2.55), P < 0.001), and individuals aged 16 to 30 years were 5.20 times more likely to suffer from malaria compared to the age group greater than 50 years (OR = 5.20, P < 0.001). The odds of malaria were about 1.43 times higher in individuals living in a joint family system (OR = 1.43 (1.03–1.99), P < 0.05), while the odds of malaria were about 4.59 times higher in large family sizes compared to small family sizes (OR = 4.59 (3.02–6.99), P < 0.001). Similarly, those with no or below matric education were about 3 times more likely to suffer from malaria compared to individuals with matric or above education (OR = 3.12 (2.21–4.39), P < 0.001). Individuals living in houses constructed with semi/partial concrete materials were about 1.76 times (OR = 1.76 (1.15–2.69), P < 0.01) more likely to suffer from malaria, and those living in houses constructed with mud/kacha materials were about 1.2 times (OR = 1.21 (0.82–1.76), P > 0.05) more likely to suffer from malaria compared to individuals living in houses with full concrete/pucca construction materials. People with low income (i.e., income < Rs. 10,000) were about 3 times (OR = 3.02 (2.01–4.53), P < 0.001) more likely, and those with middle income (i.e., income Rs. 10,000–50,000) were about 3.5 times (OR = 3.49 (2.27–5.36), P < 0.001) more likely to suffer from malaria compared to individuals with income above Rs. 50,000. People living at a distance of 1–3 km and more than 3 km from the Nearest Health Center (NHC) were about 1.7 and 2.2 times, respectively, and significantly more likely to suffer from malaria compared to those living at a distance of below 1 km from NHC. Those who were not using bed nets were about 2.59 times (OR = 2.59 (1.76–3.81), P < 0.001) more likely to suffer from malaria compared to those using bed nets while sleeping. Table 2. Results of binary logistic regression for MalariaCharacteristicsMalariaOR(CI)P-ValueNoYesGenderMale3871461.73 (1.18–2.55)0.005Female19342–Age (Years)0–15107403.18 (1.76–5.73) < 0.0116–30103635.20 (2.97–9.10)31–50200652.76 (1.61–4.75) > 5017020–House Typepucca12256–0.0090.335Semi pucca228721.76(1.15–2.69)Kacha230601.21(0.82–1.76)^NS^Family TypeJoint251981.43(1.03–1.99)0.035Individual32990–Family SizeSmall30748– < 0.01Medium163612.39(1.57–3.65)Large110794.59(3.02–6.99)Education < SSC2161223.12(2.21–4.39) < 0.01 > = SSC36466–OccupationPeasant80270.97(0.57–1.65) > 0.05Employed135491.04(0.67–1.63)Self employed201550.79(0.52–1.20)Un-employed16457–Income (Rs) < 10,000157763.02(2.01–4.53) < 0.0110,000–50000111623.49(2.27–5.36) > 50,00031250–Availability of any kind of LivestockYes246941.36(0.98–1.89)0.05No33494–Marital StatusMarried298951.03(0.74–1.44)0.83Unmarried28293–Drinking Water QualityImproved31093–0.34Not Improved270951.17 (0.84–1.63)Distance to the Nearest Health Center < 1 km18437–0.0260.0011–3 km213711.66 (1.06–2.58) > 3 km183802.17(1.40–3.38)Kind of Utensils Used for Storing WaterWithout lid210791.96 (1.374–2.78) < 0.001With lid370109–Bed Net Use?Yes23940– < 0.01No3411482.59 (1.76–3.81)Modern Toilet facility availableYes388126– > 0.05No192620.99 (0.70–1.41)^^Shows results significant at 5% level of significance^^Shows results significant at 1% level of significanceNS stands for Non-significant
Stratified analysis of bed net uses by level of education on malaria status
The results of the stratified analysis of bed net utilization by education on the effect of malaria are presented in Supplementary Table 3. It is evident that individuals with no or less education are at a higher risk of being infected with malaria compared to individuals with SSC or higher education (OR = 1.67 for no or less than SSC education vs. OR = 1.10 for educated individuals). The analysis also showed a higher prevalence of malaria (39.2%) among suspected patients with no or less education who did not use bed nets, compared to educated suspected patients who also did not use bed nets during sleep (only 14.6%).
Summary statistics for the final model of collected data
Automated model selection method with \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\alpha }_{in}=0.07\text{and }{\alpha }_{out}=0.18$$\end{document} were used for the final determinants. Using the optimal cutoff values for entry and removal of predictor variables in the model, the final multiple logistic regression model is summarized in Table 3. The results of multivariate model identified gender (AOR 1.8, CI 1.1–2.8, P < 0.05), age group 16–30 years (AOR 4.2, CI 2.1–8.5, P < 0.01), age group 31–50 years (AOR 8.3, CI 4.2–16.3, P < 0.01), age group > 50 years (AOR 3.6, CI 1.9–6.7, P < 0.01), family type (AOR 2.6, CI 1.47–4.61, P = 0.001), education level (AOR 3.9, CI 2.6–5.9, P < 0.01), income level 10,000–50000 rupees (AOR 17.8, CI 8.1–38.9, P < 0.05), large family size (AOR 11.3, CI 5.9–21.7, P < 0.05), distance to healthcare facilities more than 3 km (AOR 3.1, CI 1.8–5.3, P < 0.01), and lack of modern toilets (AOR 4.8, CI 2.4–9.4, P < 0.01) as independent risk factors of malaria prevalence. Table 3. Final model for the determinants of Malaria transmissionBS.EWalddfExp(B)95% CI**PGender0.586.2356.20511.7971.13–2.850.013Age (years)0–15**16–301.4390.35616.30114.2182.10–8.47 < 0.0131–502.1150.34537.64218.2904.21–16.32 < 0.01 > 501.2680.32715.04113.5551.87–6.74 < 0.01Family Type0.9570.29210.73312.6041.47–4.610.001Education1.3620.21440.36613.9052.57–5.94 < 0.01Income (rupees) < 10,00010,000–500002.8810.39952.034117.8248.16–38.94 < 0.01 > 50,0001.7830.27641.60615.9463.46–10.22 < 0.01Family SizeSmallMedium1.8200.34627.61116.1733.13–12.15 < 0.01Large2.4290.33054.085111.3445.95–21.70 < 0.01Distance to the Nearest Health Center < 1 km1–3 km0.6550.2755.67111.9251.12–3.300.017 > 3 km1.1320.27616.81513.1031.81–5.33 < 0.01Modern Toilet facility available1.5600.34720.27214.7602.41–9.40 < 0.01Constant− 9.5390.906110.81310.000 < 0.01
Discussion
The study demonstrated a significant association between malaria and various socio-demographic factors, including age and gender of the head of household, family size, family type, educational status, toilet availability, distance to nearest healthcare facilities, and income level. Furthermore, the proportion was 24.5% of malaria Khyber Pakhtunkhwa, highlighting the substantial burden on the health system.
The two common parasites in the study region, Plasmodium falciparum and Plasmodium vivax, impact individuals of all ages and genders [16]. In the current study, gender was a significant risk factor that influenced the frequency of malaria in a community. Women are comparatively more involved in interior domestic chores, while male heads of families are more likely to participate in outside activities, increasing their exposure to mosquitoes [17]. Many studies have explored the similar risk of getting malaria for both genders [18]. Our findings on the significant association between age and malaria status are consistent with Kigozi et al. (2020) [19]. While prior studies suggest that children under five are more likely to contract malaria [20]. other research indicates that early parental reporting, social lifestyle, knowledge, prevention, and maternal antibodies may contribute to lower malaria frequencies in children aged 1–3 [21].
The size of the family was one of the other extremely important socio-demographic factors that the current study examined. Compared to smaller families, households with more members or joint family had significantly greater odds of experiencing a malaria case, and these findings are consistent with the study reported by Thomas et. al [18]. This may be the case due to the higher likelihood of younger children in big households, who belong to a high-risk demographic. Because the olfactory signals for mosquitoes get stronger at crowding and attracting more insects, the number of occupants in a residence also increases the population of mosquitoes [22].
Another risk factor for the development of malaria was the distance between the home and the closest medical institution. According to the study's findings, family members who live close to health facilities tend to have more frequent and timely access to medical treatment than those who live far from such facilities. However, these findings are inconsistent with those of Toh et al. [23]. The distance to a health facility can be a barrier to regular health evaluations due to transportation costs and lost work time. The accessibility of a health institution is influenced by factors such as distance and transportation expenses.
The current research found a statistically significant relationship between participants' education level and malaria incidence. Respondents with lower levels of education and illiteracy were at a higher risk of contracting malaria, which is inconsistent with the findings of Siegert et al. [24]. This result was expected, as education improves knowledge of infectious diseases and necessary preventive measures. Education is thought to function as a protective social factor through various indirect pathways, including occupation, socioeconomic position, and symptom knowledge, which increases the likelihood of treatment-seeking behavior. However, a previous study in India found no association between education and malaria incidence [25].
Many studies have demonstrated a strong correlation between low income and malaria, which is commonly believed to be a disease of the poor. These findings are consistent with the current study [26]. However, other research has not shown a strong correlation between household income and malaria [17]. Additionally, malaria cases often increase after natural catastrophes, particularly in low-income nations like Pakistan [27]. The primary causes of this increase include inadequate vector control, inefficient surveillance systems, and poverty.
The study also investigated the relationship between malaria and toilets facility in households. Earlier research found that inadequate sanitation facilities represent a significant risk factor [28]. Another study found lack of toilet facility as a risk factor of malaria [29]. Another study reported that households with flush toilet had a decreased risk of malaria prevalence whereas no toilet facility increases risk of malaria [30].
The implementation of malaria control methods, such as bed nets greatly decreased the likelihood of contracting malaria. Use of bed nets prevents mosquito biting and thus prevents malaria. These findings are consistent with an earlier study [31]. However, some studies have found no correlation between the use of insecticide-treated nets, and long-lasting insecticide-treated nets [32]. In our study, bivariate analysis revealed a significant relationship between bed net use and malaria transmission, but this relationship was not significant in the final model.
There were certain limitations of the study. First, because of its cross-sectional study design, It may not show cause and effects. Second, as malaria in recurring in the region, mapping of the incidence is lacking. Therefore, in the future, we have planned to conduct study about the mapping of malaria for the targeted interventions. The altitude and latitude is also missing, which may show better interventions for the high and low altitude areas. Nevertheless, the current study has comprehensively identified the different risk factors of malaria by applying different statistical models.
Conclusion
This study highlights the multifactorial nature of malaria transmission in KP, Pakistan, with strong associations found between malaria infection and various determinants. Key risk factors identified include younger age, male gender, low educational attainment, low household income, large family size, distance from health facilities, poor housing construction, and lack of modern sanitation. Although preventive measures such as bed net use showed significance in bivariate analysis, their independent effect was not retained in the final model, suggesting the need for comprehensive interventions beyond individual behavior change. Policymakers and public health authorities should use these findings to inform district-level malaria control strategies, focusing on education, improved housing, access to healthcare, and sanitation. Government along with International NGOs should work in collaboration for preventive strategies to control malaria in the region. Future research should incorporate geographic mapping, altitude data, and seasonal patterns to enhance the targeting and effectiveness of malaria prevention programs in the region.
Supplementary Information
Supplementary file 1
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Organization WH. WHO. World malaria report 2018. https://apps.who.int/iris/bitstream/handle/10665/275867/9789241565653-eng.pdf?ua = 1. 2019.
- 2Bationo CS, Gaudart J, Dieng S, Cissoko M, Taconet P, Ouedraogo B, et al. Spatio-temporal analysis and prediction of malaria cases using remote sensing meteorological data in Diébougou health district, Burkina Faso, 2016–2017. Sci Rep. 2021;11(1):20027.10.1038/s 41598-021-99457-9PMC 850102634625589 · doi ↗ · pubmed ↗
- 3Haq IU, Mehmood Z, Albugami HF, Alaithan RH, Akhtar SJT Ajotm, hygiene. Statistical Modeling for the Identification of Determinants of Dengue Transmission in Different Districts of Khyber Pakhtunkhwa Province, Pakistan: A Cross-Sectional Study. Am J Trop Med Hyg 2025:10;113(2):353–360.10.4269/ajtmh.25-0077 PMC 1236011840494334 · doi ↗ · pubmed ↗
- 4Thomas S, Ravishankaran S, Asokan A, Johnson Amala Justin N, Maria Jusler Kalsingh T, Mathai MT, et al. Socio-demographic and household attributes may not necessarily influence malaria: evidence from a cross sectional study of households in an urban slum setting of Chennai, India. Malar J. 2018;17:1–11.10.1186/s 12936-017-2150-z PMC 575500429304794 · doi ↗ · pubmed ↗
- 5Asif AM, Tahir MR, Arshad IAJ Jo TP. Socioeconomic condition and prevalence of malaria fever in Pakistani children: findings from a Community Health Survey. J Trop Pediatr. 2018;64(3):189–94.10.1093/tropej/fmx 04728985421 · doi ↗ · pubmed ↗
- 6Mwingira V, Mboera LE, Dicke M, Takken WJ Jo VE. Exploiting the chemical ecology of mosquito oviposition behavior in mosquito surveillance and control: a review. J Vector Ecol 2020;45(2):155–79.10.1111/jvec.1238733207066 · doi ↗ · pubmed ↗
- 7Yadav K, Dhiman S, Rabha B, Saikia P, Veer VJ Idop. Socio-economic determinants for malaria transmission risk in an endemic primary health centre in Assam, India. Infect Dis Poverty. 2014;3:1–8.10.1186/2049-9957-3-19PMC 407838924991410 · doi ↗ · pubmed ↗
