Socioeconomic and Eco-Environmental Drivers Differentially Trigger and Amplify Bacterial and Viral Outbreaks of Zoonotic Pathogens
Payton Phillips, Negin Nazari, Sneha Dharwadkar, Antoine Filion, Benedicta Essuon Akaribo, Patrick Stephens, Mekala Sundaram

TL;DR
This study shows that socioeconomic and environmental factors differently trigger and amplify bacterial and viral zoonotic disease outbreaks.
Contribution
The study reveals distinct driver profiles for bacterial and viral zoonotic outbreaks, including socioeconomic amplification of viral outbreaks.
Findings
Socioeconomic factors trigger bacterial outbreaks more often than viral ones.
Ecological and environmental factors are more associated with viral outbreak triggers.
Socioeconomic factors amplify viral outbreaks, leading to higher case numbers.
Abstract
The frequency of infectious disease outbreaks and pandemics is rising, demanding an understanding of their drivers. Common wisdom suggests that increases in outbreak frequency are driven by socioeconomic factors such as globalization and urbanization, yet, the majority of disease outbreaks are caused by zoonotic pathogens that can be transmitted from animals to humans, suggesting the important role of ecological and environmental drivers. Previous studies of outbreak drivers have also failed to quantify the differences between major classes of pathogens, such as bacterial and viral pathogens. Here, we reconsider the observed drivers of a global sample of 300 zoonotic outbreaks, including the 100 largest outbreaks that occurred between 1977 and 2017. We show that socioeconomic factors more often trigger outbreaks of bacterial pathogens, whereas ecological and environmental factors…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2- —National Institutes of Health
- —National Science Foundation
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsZoonotic diseases and public health · Animal Disease Management and Epidemiology · Viral Infections and Vectors
1. Introduction
Understanding the spatial and temporal distribution of novel infectious disease events is among the most important tasks for the coming century [1,2,3]. According to the World Health Organization [4], nearly 10 million people die from infectious diseases each year. This includes a range of diseases such as respiratory infections, diarrheal diseases, tuberculosis, HIV/AIDS, malaria, and others. The overall frequency of disease outbreaks also appears to be increasing [2], and more than half of outbreaks and emerging infectious diseases (EID) events are zoonotic [1,2]. This indicates a critical need for a better understanding of the factors that cause and propagate disease outbreaks, particularly those caused by zoonotic pathogens that can be transmitted from animals to humans.
It has been speculated that increases in outbreak frequency are related to socioeconomic factors such as urbanization [5,6], agricultural activity [7], and globalization reviewed in [2,8,9]. However, the fact that the majority of disease events are caused by zoonotic pathogens [1,2] implies that environmental and ecological factors, including climate change [10], changes in the distribution and abundance of vectors [11], and biodiversity loss [12], might play an unexpectedly large role. It is currently difficult to assess whether socioeconomic factors or eco-environmental factors predominate, because there have been so few quantitative global studies of disease dynamics. To date, most quantitative studies have been focused on individual pathogens, e.g., [13,14,15,16,17,18,19], or in some cases, individual disease events, e.g., [20,21,22]. There have been relatively few studies characterizing the drivers of outbreaks in general, that is, of many different types of diseases considered in aggregate, but see [1,2,23,24]. Furthermore, no global studies that we are aware of have tested for differences in the drivers of outbreaks of major natural classes of pathogens. There is a great need for a better understanding of how the drivers of disease dynamics can both be shared and differ among various disease systems.
For example, Stephens et al. [24] quantified the drivers of a global sample of 300 zoonotic outbreaks. They showed that the driver profiles of the largest outbreaks, with thousands or more cases, differed significantly from those of randomly selected outbreaks, and also showed that large outbreaks were more often caused by viral than bacterial pathogens. However, even this study did not explicitly consider whether and how the drivers observed generally differed between viral and bacterial pathogens considered in aggregate. From a disease transmission and management perspective, these groups exhibit many important differences. For one, there are no broad-spectrum antivirals analogous to antibiotics in wide use, and so the evolution of antimicrobial resistance is not an important factor in viral outbreaks [25]. Viruses also tend to evolve more quickly than bacteria due to shorter generation times, with some viral lineages showing the highest rates of molecular evolution known across the tree of life [26,27]. Perhaps because of this, viruses are also more often able to infect multiple and/or distantly related host species than bacterial pathogens [28]. Somewhat surprisingly, whether these and other differences in the characteristics of viruses and bacteria translate into differences in the drivers commonly associated with outbreaks has never been systematically tested. For example, whether the socioeconomic factors that are thought to be driving increases in overall outbreak frequency [2,9] are more important compared to ecological and environmental factors, in generating or amplifying viral or bacterial outbreaks, has never been directly quantified.
Here, we reconsider the reported drivers of a global sample of 300 zoonotic disease outbreaks [24]. We focus on two main questions: (1) are there significant differences in the relative importance of socioeconomic versus ecological and environmental factors between viral and bacterial outbreaks, and (2) does the proportion of socioeconomic factors that contribute to an outbreak influence variation in realized case numbers, and if so, is this effect strongest in outbreaks of viral or bacterial pathogens? Though we consider these analyses primarily proof-of-concept, we regard them as a first step towards illuminating variation in the specific triggers and amplifiers shaping the dynamics of bacterial and viral outbreaks across the globe.
2. Materials and Methods
This study is based on the outbreak data of Stephens et al. [24], which consist of a sample of outbreaks from a database of approximately 4000 zoonotic outbreaks that occurred between 1974 and 2017. It includes the 100 largest outbreaks (in terms of realized case numbers) and 200 additional outbreaks sampled at random from the entire database. Here, the particular pathogens we define as “zoonotic” follow the recommendations of several zoonotic specialist groups including the CDC [29], UK Health Ministry [30], and Pan American Health Organization [31]. However, in general, we include pathogens in which wild animal reservoirs are at least a somewhat frequent source of infection. We reaggregated the data of Stephens et al. [24] to the country level, assigning outbreaks that spread to more than one country to multiple rows (one for each country). In the end, we had 106 “large” outbreaks (42 bacterial and 64 viral) and 210 randomly sampled “background” outbreaks (175 bacterial and 35 viral).
Stephens et al. [24] scored the reported drivers of outbreaks using a schema of 48 potential drivers, which was further described in Stephens et al. [32]. It consists of a binary rubric of 48 potential drivers (Table 1) based on factors discussed in reviews and syntheses of the literature on zoonotic outbreaks, e.g., [33,34,35,36]. The schema is designed to represent a variety of different kinds of drivers including ecological [37,38], environmental [7,39], and socioeconomic [40,41,42] factors. The drivers of outbreaks are scored based on factors reported both in peer-reviewed literature, as well as high quality gray literature written by specialists, including ProMED e-mails [43], Morbidity and Mortality Weekly Reports [29], and WHO reports, e.g., [44,45]. For each individual outbreak, a given driver was scored as either (0), not reported as contributing to an outbreak or (1), reported as contributing to an outbreak by at least one source, with a source noted.
We divided the 48 drivers of Stephens et al. [24] into socioeconomic (SE) and/or ecological and environmental (EE) drivers (see Table 1). Socioeconomic drivers were those dealing primarily with social and economic aspects of human society, such as poverty, medical systems and interventions (for example antibiotics), cultural practices, trade, and travel. Ecological and environmental drivers were those pertaining to natural systems, such as weather, climate change, and changes to vector and reservoir populations, or interventions in natural systems such as the introduction and spread of invasive species. Some drivers were also considered both. For example, encroachment on wild areas occurs at the interface of settled and wild areas and is generally driven by SE factors. However, the disease transmission risk that it represents also largely depends on EE factors, such as the identity of wild animal hosts or vector abundance. The drivers we considered were in turn derived from synthetic reviews of the literature on zoonotic outbreaks (e.g., [33,34,36,46], see Stephens et al. (2021) for a more detailed description). For each outbreak, we calculated the proportion of drivers recorded as SE vs. EE (Figure 1).
2.1. Statistical Analyses
Using the data described above, we conducted two sets of analyses. Data and code for all analyses presented are included in the Supplementary Materials. All analyses were conducted in R v. 4.3.2 (Vienna, Austria) [47].
2.1.1. Triggers of Bacterial vs. Viral Outbreaks
We tested whether bacterial vs. viral outbreaks tended to have a greater proportion of socioeconomic drivers or eco-environmental drivers. First, we used logistic regression to compare the type of pathogen that caused an outbreak, (0) viral or (1) bacterial, to the proportion of reported SE drivers (i.e., the proportion of drivers that were either SE alone or both). Then, we compared the type of outbreak, (1) viral or (0) bacterial, to the proportion of reported EE drivers (i.e., the proportion of outbreaks that were either EE alone or both). We included outbreak year as a covariate in both sets of models to account for possible differences in sampling effort, on the assumption that more recent outbreaks may be more thoroughly characterized than older outbreaks (see discussion of sample bias in Section 2.2 below). Finally, we conducted separate analyses of large and background outbreaks, for a total of four models. To ensure that the classification of drivers did not unduly influence our results, we conducted sensitivity analyses for our models with different definitions of EE and SE. First, we included the drivers classified as both SE and EE when calculating the proportion of either driver type. Second, we calculated the proportion of each driver type while excluding those categorized as both, essentially assuming that they were categorized as the opposing type of driver. We found that the classification of drivers did not influence our results and therefore report only the one set of analyses with “both” drivers in the main text (see Supplementary Materials Table S1 for analyses excluding “both” drivers).
2.1.2. Amplifiers of Bacterial and Viral Outbreaks
Next, we considered whether the proportion of SE vs. EE drivers was related to variation in realized case numbers among outbreaks. Using negative binomial regression implemented in the R package MASS v. 7.3-61 [48], we compared the number of reported cases in each outbreak to the proportion of SE drivers, again including outbreak year as a covariate. We performed this analysis for (1) large bacterial outbreaks, (2) background bacterial outbreaks, (3) large viral outbreaks, and (4) background viral outbreaks. We parameterized each model twice, once with drivers categorized as both EE and SE drivers and once without. We found no qualitative difference in the models and report only the results with all drivers included here (see Supplementary Materials Table S2).
2.2. Sample Bias
Analyses of large spatiotemporal databases of diseases can introduce problems if reporting effort is not accounted for [2,24]. However, variation in sampling effort is unlikely to have affected our analyses. Our logistic regression analyses of viral vs. bacterial outbreaks would only be biased if there were some consistent difference in sampling effort between these two classes of pathogens. We are unaware of any evidence that suggests that bacterial or viral outbreaks tend to be consistently better studied. Stephens et al. [24] did show some evidence that large outbreaks were better studied, often being documented in more sources and having more reported drivers than background outbreaks. This is the reason that we conducted separate analyses of large and background outbreaks. Another possibility that has not, to our knowledge, been discussed previously, is that more recent outbreaks may tend to be better documented than older outbreaks, due to increases in the quality and quantity of communication infrastructure (e.g., phone and internet availability), medical technology, and other similar factors over time. In order to account for this possible bias, we also included outbreak start year as a covariate in all of our models. In addition, we explored additional potential sources of bias by including country-level gross domestic product (GDP) and population density in our models [49]. Past research has suggested that indicators of economic activity, such as GDP, may relate to improved health infrastructure and surveillance capabilities, leading to higher chances of outbreak detection [2,50]. More densely populated areas may also lead to higher outbreak detections [24]. The results of analyses that included the covariates were qualitatively identical with respect to the hypotheses tested. Furthermore, one of the models including these factors was overfitted and suffered from a lack of convergence. Therefore, we report only the results of models including year in the main text (for models including GDP and population density covariates, see Supplementary Materials Tables S3 and S4). Our results proved quite robust and generally consistent across all subsets of the data we considered, and similar regardless of whether bias covariates were significantly correlated with a given response variable (see results). This suggests that reporting efforts likely did not influence the recorded drivers in outbreaks and did not differentially influence viral vs. bacterial outbreak dynamics.
3. Results
We examined large and background outbreaks independently with logistic regressions. Across both datasets, our analyses found that bacterial outbreaks were positively related to the proportion of SE drivers recorded and not to outbreak start year (Table 2). The odds of a large outbreak being bacterial increased by e^6.35^ = 572 times when the proportion of SE drivers recorded increased from 0 to 1. Even for background outbreaks of smaller sizes, the odds of a bacterial outbreak increased by 17 times (e^2.83^) with a change from 0 to 100% SE drivers (Table 2). We observed similar patterns for the effect of EE drivers on viral outbreaks (Table 2). If the proportion of EE drivers changed from 0 to 1, the odds that an outbreak had viral origins increased by e^5.06^ = 158 times for large outbreaks and by e^2.45^ = 12 times for background outbreaks (Table 2).
The size of bacterial pandemics and epidemics was not driven uniquely by SE or EE drivers. Our negative binomial regressions found no significant predictor of case numbers in either large or background bacterial outbreak datasets (Figure 2a; Table 3). Start year of the outbreak was also not related to case numbers (Table 3). Conversely, we found a positive slope for proportion of SE drivers and viral outbreak case numbers both in the large and background outbreak datasets (Figure 2b; Table 3). Case numbers increased by e^1.79^ = 6 times in large viral outbreaks, with an increase in the proportion of SE drivers from 0 to 1 (Table 3). For background viral outbreaks, case numbers increased 31 times (e^3.44^) when all SE drivers were reported, as opposed to no SE drivers reported (Table 3). Start year showed a small negative coefficient in predicting case numbers for viral outbreaks in both large and background outbreak datasets (Figure 2c; Table 3).
4. Discussion
Global increases in outbreak frequency [2] and disease emergence events [1] indicate a great need for more comprehensive management strategies that can potentially impact the dynamics of many different diseases. This is because it will be difficult to predict exactly where and when a particular disease may manifest next. However, infectious disease outbreaks are also caused by a wide variety of pathogens with varying characteristics. Furthermore, there is, as yet, no consensus as to whether socioeconomic (SE) factors such as globalization and urbanization or eco-environmental (EE) factors such as climate change, weather patterns, and the population dynamics of vectors are most important in driving global trends in outbreak frequency. As a proof-of-concept, we divided outbreaks along an obvious biological axis, considering whether the influence of SE vs. EE drivers differs between viral and bacterial outbreaks. The results of our analysis reveal important similarities and differences among outbreaks caused by different pathogens.
We show that SE factors were much more important in driving bacterial than viral outbreaks overall, with the chances that an outbreak was bacterial increasing drastically as the proportion of SE drivers that contributed to an outbreak increased (Table 2; Figure 1). These results corroborate previous works implicating socioeconomic triggers such as poverty, poor sanitation, and drug/antibiotic usage in the start of bacterial outbreaks [24,51,52]. For instance, some bacterial outbreaks result from the consumption of unwashed foods or contamination with fecal matter [24], while others begin from the evolution of drug resistance [53]. The latter factor has especially been cited as a growing threat to global public health [54].
In contrast to bacterial outbreaks, viral disease emergence is often driven by eco-environmental factors (Table 2; Figure 1), including climate change, changes in reservoir (or vector) abundances, and human–animal interface dynamics. These results are supported by separate statistical analyses for both large and background viral outbreaks (Table 2), as well as by past studies examining the origins of viral outbreaks [1,24,55]. For example, bat–human interfaces are known to initiate filoviral and henipaviral infections [32,56,57,58,59]. In general, resource use by bats is a key ecological driver of many bat-borne diseases [56,58,60]. Climatic conditions are additionally frequently cited as drivers of viral vector-borne diseases such as Crimean Congo Hemorrhagic Fever and Rift Valley fever [61,62].
Somewhat conversely, the proportion of SE factors rather than EE factors was important in amplifying viral outbreaks, with the expected case numbers increasing as the proportion of SE factors reported to drive an outbreak increased (Table 3, Figure 2b). This result supports the hypothesis that travel and trade, medical infrastructure, war, and other socioeconomic parameters increase case numbers and exacerbate the spread of viral pathogens [24,32,63,64,65]. The recent COVID-19 pandemic, which occurred after the most recent outbreaks included in the data we used for this study, is a prime example of a very large viral outbreak with case numbers driven by a myriad of socioeconomic drivers such as unequal access to healthcare, international/national travel, and vaccine hesitancy [65,66,67]. The rapid spread and rise in case numbers as a result of SE drivers mirrors the 6-fold increase in case numbers as SE drivers increased in our results.
In contrast to viral outbreaks, the proportion of EE vs. SE factors had little influence on case numbers reported in outbreaks of bacterial pathogens. Although the trends were in the same direction as those of viral outbreaks, they were not statistically significant in large or background bacterial outbreaks (Table 3, Figure 2a). This suggests that no single class of drivers is responsible for large epidemics of bacterial pathogens. Instead, bacterial outbreak size may be related to a synthesis of multiple socioeconomic and eco-environmental drivers or to a few specific drivers, rather than an aggregate of driver types. Stephens et al. [24] did examine what drivers are responsible for large outbreaks. Their conclusion was that specific critical failures, such as the contamination of a sewage system [68] or large-scale food contamination events [69], which would both be categorized as socioeconomic drivers in our analysis, typically resulted in a greater spread of bacteria.
Just as important as the differences that we found between viral and bacterial outbreaks was the fact that within each class of pathogen, the results were surprisingly consistent across large and background outbreaks. Though the strength of the relationships observed varied somewhat, the qualitative results were highly consistent. The proportion of EE drivers influenced the chances that both large and background outbreaks were caused by a viral pathogen. Perhaps more importantly, the fact that viral outbreaks with more SE drivers tended to be larger, with more reported cases, was also consistent between large and background viral outbreaks. This is in sharp contrast to the results of Stephens et al. [24], which showed significant differences in the driver profile of large vs. background outbreaks. The consistency of our results likely indicates that even taking simple biological differences between pathogens, viral vs. bacterial in the case of our analyses, can help make commonalities among outbreaks of different diseases more apparent. We also speculate that aggregating all drivers into one of two broad categories (i.e., SE or EE), instead of using the more granular approach of Stephens et al. [24,32], may have also helped reveal commonalities between large and background outbreaks.
For the first time, we also considered the potential of the year in which an outbreak occurred to bias our results. We did this by including outbreak start year as a covariate in all of our analyses. Smith et al. [2] showed that factors such as number of phone lines and number of internet users, which in most countries have tended to increase over time [49], can bias disease data, with more outbreaks generally documented in countries with better communication infrastructure. Improvements in medical technology and advances in epidemiology as a discipline over time could also change how disease outbreaks are understood and investigated. However, we found no evidence that these factors biased our analyses. Generally, there was no relationship between outbreak start year and any response variable we considered. We did find a small and negative effect of start year on case numbers for viral outbreaks (Table 3). However, we suggest that this effect is reflective of the raw data that was published prior to the COVID-19 pandemic (see original publication and data in Stephens et al. [24]). We speculate that if more recent outbreaks such as Monkeypox (N = 102,000 cases [70]) and COVID-19 (N = 777,315,739 cases [71]) were included, this parameter would not be significant.
Another potential source of bias that we did not directly consider is that our scored drivers reflect only the most discussed drivers, which may not always include the true predictors of infectious diseases. Certainly, the drivers examined in this study were gathered from published narratives describing outbreaks (see Stephens et al. [24] for details), which might be subject to biases. However, scored drivers do statistically differentiate large outbreaks, as shown in Stephens et al. [24], and filovirus outbreaks, as explored in Stephens et al. [32]. These unique reported drivers associated with many different types of outbreaks could provide important indicators of infectious disease spread that warrant more attention. Quantifying the critical socioeconomic drivers triggering bacterial outbreaks and/or leading to the spread of viral cases is an important next step. Similarly, quantifying the EE triggers of viral disease outbreaks is necessary in order to forecast when and where new outbreaks will arise. In doing so, we may be able to prevent outbreaks or address them early on, before case numbers rise due to other factors.
5. Conclusions
Overall, we suggest that local conditions can enhance the chances of specific types of outbreaks [72]. Although this finding is not new, very few have investigated broad drivers and their association with broad pathogen classes. We propose that this angle of investigation opens up new areas of insights. For example, if broader groups of pathogens, such as bacteria and viruses, demonstrate unique mechanistic triggers and drivers of case numbers, then other broad classes of pathogens, such as protozoa and helminths, or those sharing similar transmission modes, are also likely to show important commonalities and differences [72]. Our work points to a great need for more research to quantify and classify the factors that contribute to major classes of infectious disease outbreaks.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Jones K.E. Patel N.G. Levy M.A. Storeygard A. Balk D. Gittleman J.L. Daszak P. Global Trends in Emerging Infectious Diseases Nature 200845199099310.1038/nature 0653618288193 PMC 5960580 · doi ↗ · pubmed ↗
- 2Smith K.F. Goldberg M. Rosenthal S. Carlson L. Chen J. Chen C. Ramachandran S. Global Rise in Human Infectious Disease Outbreaks J. R. Soc. Interface 2014112014095010.1098/rsif.2014.095025401184 PMC 4223919 · doi ↗ · pubmed ↗
- 3Khorram-Manesh A. Goniewicz K. Burkle F.M. Unleashing the Global Potential of Public Health: A Framework for Future Pandemic Response J. Infect. Public Health 202417829510.1016/j.jiph.2023.10.03837992438 · doi ↗ · pubmed ↗
- 4World Health Organization (WHO) The Top 10 Causes of Death Available online: https://www.who.int/news-room/fact-sheets/detail/the-top-10-causes-of-death(accessed on 31 January 2025)
- 5Disease and Urbanization 1st ed. Clegg E.J. Garlick J.P. Routledge Library Editions: Health, Disease, and Society Routledge New York, NY, USA 2022 Volume 11978-1-032-25318-3
- 6Neiderud C.-J. How Urbanization Affects the Epidemiology of Emerging Infectious Diseases Infect. Ecol. Epidemiol.201551910.3402/iee.v 5.27060 PMC 448104226112265 · doi ↗ · pubmed ↗
- 7Jones B.A. Grace D. Kock R. Alonso S. Rushton J. Said M.Y. Mc Keever D. Mutua F. Young J. Mc Dermott J. Zoonosis Emergence Linked to Agricultural Intensification and Environmental Change Proc. Natl. Acad. Sci. USA 20131108399840410.1073/pnas.120805911023671097 PMC 3666729 · doi ↗ · pubmed ↗
- 8Smith K.F. Sax D.F. Gaines S.D. Guernier V. Guégan J.-F. Globalization of Human Infectious Disease Ecology 2007881903191010.1890/06-1052.117824419 · doi ↗ · pubmed ↗
