The role of social determinants in COVID-19 hospitalization disparities by migration status in Stockholm, Sweden. A population-based cohort study
Yan Ma, Anders Ledberg, Siddartha Aradhya, Sol P. Juárez

TL;DR
This study finds that social and health factors only partly explain why immigrants in Sweden had higher risks of being hospitalized with COVID-19 compared to Swedish-born people.
Contribution
The study quantifies how much social and health factors contribute to hospitalization disparities among immigrant groups in Sweden.
Findings
Immigrants from Africa and the Middle East had higher hospitalization risks compared to Swedish-born individuals.
Social and health factors explained only a small portion of the disparities for the most affected immigrant groups.
Occupation type and residential area were significant contributors to the disparities.
Abstract
Immigrants in Sweden, particularly those from low- and middle-income countries, had higher risks of COVID-19 mortality and morbidity compared to the Swedish-born. However, prior studies have not quantified the contribution of the differential distribution of health and social determinants to the increased risks. We used total population registers from Sweden to investigate disparities in COVID-19 hospitalization between five groups of immigrants and Swedish-born, using a cohort 577911 working-age adults (18–65 years) living in Stockholm during the first two waves of the COVID-19 pandemic. Applying a decomposition analysis, we quantified the relative contribution of age, sex, income, education, occupation type, residential area, and pre-existing medical conditions to these disparities. Our study shows that immigrants have higher risks of hospitalization compared to Swedish-born, and…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMigration, Health and Trauma · COVID-19 and Mental Health · Racial and Ethnic Identity Research
Introduction
Immigrants (i.e., foreign-born) in many countries were disproportionately affected by the COVID-19 pandemic^1–4^. In Sweden, immigrants, particularly from low- and middle-income countries, such as Iran, Iraq, and Somalia, had significantly higher mortality rates and more severe complications during the pandemic compared to the Swedish-born^5–8^. The greater impact of the pandemic on immigrants, both in Sweden and other countries, was in stark contrast to their often lower mortality compared to native populations prior to the pandemic^9–12^.
Differences in COVID-19 outcomes between immigrants and the Swedish-born have been reported for both working- and retirement-age populations, with higher risk ratios among the working-age population^8^. These differences remained after adjusting for health and socio-demographic variables—factors likely reflecting a mixture of differential exposure and susceptibility^5,8,10^. However, these studies did not quantify the relative contribution of each factor, nor did they examine in detail the possibility that the same set of health and social factors may impact the risk differently for different immigrant groups. In other words, previous studies have not completely acknowledged the heterogeneity of the immigrant groups in Sweden, as they implicitly assume that all socio-demographic variables have the same “effects” across all origins.
The aim of this study is to apply a decomposition analysis to quantify the contributions of several factors to COVID-19 hospitalization risk differences between immigrants and the Swedish-born population residing in Stockholm Municipality during the first two waves of the pandemic (March 2020 to January 2021). We focus on the working-age group since the relative risk is higher in this group, and moreover, occupation can be used as an indicator for COVID-19 exposure in this group. Overall, this study shows that immigrants had higher risks of COVID-19–related hospitalization compared to the Swedish-born, with large unexplained disparities remaining for groups from Africa and the Middle East. These findings highlight the need for efforts to identify the determinants underlying the differences in hospitalizations risks between these groups and the Swedish-born.
Methods
Study population and data
This study focuses on the first two waves of the COVID-19 pandemic, prior to the mass vaccination period. Here we consider the first wave (Wave 1) from March 1, 2020 to August 31, 2020, and the second wave (Wave 2) from September 1, 2020 to January 31, 2021.
The study population comprised all working-age individuals (18–65 years old in 2020) who were alive and had a registered residential address in Stockholm Municipality as of 2019, totaling 640,207 individuals. We excluded 15,650 individuals with missing records of country of birth. To ensure that pre-existing medical conditions were measured over the same period of time for all individuals, we included only those who had continuously resided in Sweden for at least five years before the onset of the pandemic (2015–2019). This criterion led to the exclusion of an additional 46,065 individuals, among whom 64 and 56 were hospitalized due to COVID-19 in Wave 1 and Wave 2, respectively—corresponding to lower hospitalization rates compared to the remaining population. Individuals who died before the start of the pandemic were excluded (150). We also excluded individuals who died from COVID-19 without prior hospital admission or from other causes during the respective wave (431 in Wave 1, and further 310 in Wave 2 after excluding 64 people died in wave 1, out of which 20 and 3 died from COVID-19 without prior hospital admission). After applying these exclusion criteria, the final study population comprised 577,911 individuals for Wave 1 and 577,537 for Wave 2.
Data were obtained from multiple national registers linked via unique pseudo-anonymized identification numbers. Data on mortality, including dates and causes, were retrieved from the Cause of Death Register (CDR). Data on inpatient care (hospitalizations), with dates and diagnoses, were obtained from the National Patient Registry (NPR). Data on individual sociodemographic background, including year of birth, sex, and residential areas, and country/region of birth was obtained from the Total Population Register from 2019. Data on occupation, disposable income, education attainment were retrieved from the Longitudinal Integrated Database for Health Insurance and Labor Market Studies (LISA).
Outcome
COVID-19 hospitalization (hospitalized vs. not hospitalized) was identified through the main diagnosis recorded in the National Patient Register (NPR) with the corresponding ICD codes: U07.1, and U07.2.
Migration status
Immigration status was defined by the country of birth, and was grouped into seven geographic countries/regions: Sweden, Other Nordic Countries (Finland, Denmark, Norway, and Iceland excluding Sweden), Other European Countries (including Russia), Middle East (including Turkey), Africa, and Rest of the World. Hereafter, the names of regional groups (e.g., “Other Nordic Countries”) refer to immigrants from the countries included in each respective group, unless stated otherwise. Countries/regions within each regional group can be found in Supplementary Table S1.
Covariates
Age was calculated as the difference between 2020 and the individual’s birth year. Pre-existing medical conditions associated with severe COVID-19 outcomes^13^ were assessed using information from the National Patient Registry. In particular we constructed a binary variable indicating whether an individual had at least one of the following main diagnosis recorded in within five years prior to the pandemic (January 1, 2015 – December 31, 2019): chronic kidney disease (ICD-10 code: N18), diabetes (E08–E13), cardiovascular diseases (CVD: E78, G45–G46, I10–I13, I20–I26, I63–I66, I80–I82), neurological diseases (G20, G21, G30, G35–G37), chronic respiratory diseases (J40–J47), tuberculosis (A15–A19), HIV (B20–B24), chronic liver disease (K70–K77), cancer (C00–C96) and sickle cell disease or thalassemia (D56–D59). Occupation type from 2019 was classified into three categories, “essential occupations”, “non-essential occupation”, “unemployed or unregistered”. Essential occupations were defined as in Billingsley et al.^14^, and included care workers, police officers and security guards, service sector personnel, delivery workers, taxi- and bus drivers, teachers, meat packers, and cleaners (based on Swedish standard SSYK 2012, codes can be found in Supplementary Table S2). Individuals with a registered occupation that was not classified as essential were categorized as “non-essential workers.” The “unemployed or unregistered” category included individuals who did not have an occupation registered in the system, including not only unemployed individuals but also those working in unreported jobs or informal employees, such as part-time employees in restaurants. Disposable income from 2019 was divided into tertiles: low-, medium-, and high-income levels. Education attainment (according to Swedish Education Classification SUN2020) was classified into primary education (9 or fewer years of education), upper secondary education (10–12 years of education), post-secondary education ( > 12 years of education) and missing (individuals with missing educational attainment). As an indicator of area of residence, we used “District”, a nationwide geographical division corresponding to the parish division as on December 31, 1999. The municipality of Stockholm has 28 districts with areas ranging from 0.56 km^2^ to 17.63 km^2^, and populations from 4584 to 59154 inhabitants. Given that the outcome we study is relatively rare, a tessellation with smaller geographical regions was deemed not meaningful. To determine which district someone resided in, we used information on which demographic statistical areas, DeSO (demografiska statistikområden in Swedish) someone resided in.
Statistical analyses
Hospitalization risk was taken as the number of hospitalized people divided by corresponding population in the beginning of each wave. To examine the association between country/region of birth and COVID-19 hospitalization, we estimated both unadjusted and adjusted odds ratios (ORs) using logistic regression. The regression models assessed whether being an immigrant from a specific country or region was associated with risk of hospitalization, both without and with adjustments for all covariates of interest. Given that hospitalization risks are small (in the order of less than 1%), we used odds and risks synonymously in the remaining.
To quantify the individual contribution of the factors to the hospitalization risk difference, we applied a decomposition analysis. This method is an extension of the Oaxaca-Blinder decomposition^15,16^ to nonlinear models^17^, including logit and probit models, making it suitable for the binary outcomes studied herein. The aim of this decomposition is to explain the hospitalization risk difference by decomposing it into two components: the composition effect, accounting for differential distribution of the measured factors, and; the coefficient effect, accounting for the remaining risk difference. The interest lies in the composition effect which quantifies how much of the hospitalization risk differences can be attributed to differences in how the measured factors are distributed between immigrant groups and the Swedish-born. For example, age is a major risk factor for complications of COVID infections, and the composition effect quantifies the extent to which differences in age structures between two groups lead to differences in hospitalization risks. Intuitively, a large coefficient effect means either that the model is incomplete or that different factors have different effects in different groups. However, the contribution of each factor to coefficient effect cannot be quantified due to the nonlinear link function in the logistic regression^17^. To still get some insight into this issue, we fitted logistic regression models to data for each group separately, of which the results are shown in Supplementary file and briefly discussed in the next section. To ensure a robust estimate of coefficients to calculate composition effect, we used pooled coefficients, i.e., coefficients from logistic regressions including all study population with dummy variables for regional groups. By doing so, we assume the pooled coefficients are the reference coefficients or a nondiscriminatory condition where all population following the same effects on included factors. Details of this method are described in the Supplementary file.
All factors added into the analysis were quantified or categorized as described previously, except for a higher-order term of age (age^2^) that was also added into logistic regression and later decomposition analysis to account for non-linearity in the relationship between age and hospitalization.
Data management was performed using Python 3.12.1. Logistic regression and Fairlie decomposition analysis were conducted in STATA MP 15.1 using the Fairlie software package^18^.
Ethics and inclusion statement
The study has received approval from the Regional Ethical Review Board of Stockholm, that also waived the need for informed consent due to the anonymized nature of the data material (decision no. 2022-00428-01). The data were processed in accordance with the General Data Protection Regulation and the research was performed in accordance with the relevant guidelines and regulations (Declaration of Helsinki).
Results
COVID-19 hospitalization risk and associated factors by country/region of birth and epidemic wave
Table 1 presents the COVID-19 hospitalization risk and the distribution of covariates across groups defined by country/region of birth. Figure 1 shows the hospitalization risk across districts. Hospitalization risk was higher among all immigrant groups compared to the Swedish-born, in both waves. In Wave 1, the Swedish-born had a hospitalization risk of 0.14%. Among immigrants, the highest risk was observed in Middle East (0.79%), followed by Africa (0.68%), Rest of the World (0.49%), Other Nordic Countries (0.32%), and Other European Countries (0.21%). In Wave 2, hospitalization risk decreased for all groups compared to Wave 1, but Middle East continued to have the highest risk (0.48%). These large differences in hospitalization are much greater than for infectious and parasitic diseases or for respiratory system diagnoses (excluding COVID-19) before and during the pandemic (Supplementary Fig. S1).Fig. 1. Percentage of immigrants and hospitalization risk by district and wave.Percentage of immigrants in each district (a); hospitalization risk (in %) in each district for (b) Wave 1 and (c) Wave 2. Districts: 1. Bromma; 2. Brännkyrka; 3. Enskede; 4. Essinge; 5. Farsta; 6. Hägersten; 7. Hässelby; 8. Högalid; 9. Kista; 10. Kungsholmen; 11. Skarpnäck; 12. Skärholmen; 13. Spånga; 14. Stockholms Adolf Fredrik; 15. Stockholms Engelbrekt; 16. Stockholms Gustav Vasa; 17. Stockholms Hedvig Eleonora; 18. Stockholms Katarina; 19. Stockholms Maria Magdalena; 20. Stockholms Oscar; 21. Stockholms Sankt Göran; 22. Stockholms Sankt Johannes; 23. Stockholms Sankt Matteus; 24. Stockholms Sofia; 25. Stockholms domkyrkodistrikt; 26. Vantör; 27. Vällingby; 28. Västerled.Table 1. Hospitalization risks (Stockholm, March 1, 2020 to January 31, 2021) and descriptive statistics for the study population (Wave 1) from different country/region of birth. Except population and age, all values are in percentageCharacteristicsAll PopulationSwedenAfricaMiddle EastOther Nordic CountriesOther European CountriesRest of the WorldPopulation577,911438,21824,44236,22410,21434,59034,223Person-time, total amount (average)289 096(0.5002)219,362(0.5006)12.184(0.4985)18,049(0.4983)5106(0.5000)17,304(0.5003)17,090(0.4994)Percentage of Total Population75.84.26.31.86.05.9Hospitalization Incidence in Percentage Wave 10.230.140.680.790.320.210.49 Wave 20.150.090.290.480.250.180.30Sex Female50.450.148.846.461.852.254.3 Male49.649.951.353.638.247.845.7Age Mean41.040.242.444.249.143.042.3 SD12.913.111.712.012.211.611.3Pre-existing Medical Conditions No90.791.088.587.885.192.290.5 Yes9.49.011.512.214.97.89.5Occupation Type Essential20.217.144.930.717.921.030.1 Non-essential57.763.224.134.462.350.742.4 Unemployed or Unregistered22.119.731.034.919.928.327.6Income Level Low33.330.148.747.327.640.644.1 Medium33.332.639.335.233.833.536.7 High33.337.412.017.538.725.919.3Education Level Primary10.98.927.423.07.48.515.3 Upper secondary31.731.642.234.828.926.030.0 Post secondary56.059.027.240.361.156.951.3 Missing1.40.53.21.92.68.63.4District Bromma5.55.92.33.56.04.94.8 Brännkyrka5.55.64.05.05.05.36.0 Enskede4.65.12.42.24.53.23.8 Essinge1.01.20.20.31.30.90.7 Farsta6.05.96.65.55.36.28.1 Hägersten8.19.02.73.98.47.26.1 Hässelby3.83.17.17.44.05.44.8 Högalid3.43.90.71.03.72.42.1 Kista3.31.512.513.32.64.47.4 Kungsholmen2.12.40.40.72.21.51.2 Skarpnäck5.25.53.82.65.75.04.9 Skärholmen3.62.09.012.92.17.17.3 Spånga5.33.127.5153.96.26.6 Stockholms Adolf Fredrik1.01.10.20.51.00.90.6 Stockholms Engelbrekt3.03.20.71.33.43.72.7 Stockholms Gustav Vasa1.61.80.30.61.71.40.9 Stockholms Hedvig Eleonora1.11.30.20.41.20.90.5 Stockholms Katarina3.94.51.21.34.32.72.5 Stockholms Maria Magdalena2.12.40.50.62.41.51.4 Stockholms Oscar3.94.50.51.24.02.92.1 Stockholms Sankt Göran3.55.21.12.75.33.93.4 Stockholms Sankt Johannes1.31.40.30.61.21.30.9 Stockholms Sankt Matteus3.54.00.51.23.62.82.1 Stockholms Sofia4.24.70.71.44.93.32.7 Stockholms domkyrkodistrikt0.50.60.10.30.60.60.4 Vantör5.54.78.66.54.78.39.7 Vällingby3.83.25.77.63.64.45.1 Västerled2.93.40.50.73.32.11.3Descriptive data corresponding to Wave 2 can be found in the Supplementary Information Table S3.
Statistics in Table 1 show that the measured factors are differentially distributed across immigrant groups compared to the Swedish-born. Notably, some immigrant groups have a much higher percentage of essential workers than the Swedish-born population, and also show higher residential clustering. For example, 44.9% of immigrants categorized in the group Africa have occupation belonging to the category of essential workers, compared to 17.1% among the Swedish-born, and almost half of them resided in three out of 28 districts (Spånga, Kista, and Skärholmen), together accounting for 12.4% of the total population. These differential distributions justified the use of decomposition analysis to quantify the composition effect.
Association analysis of COVID-19 hospitalization between immigrants by region of birth vs Swedish-born
Table 2 shows the estimates derived from logistic regression models fitted to examine how the association between COVID-19 hospitalization and country/region of birth changes when adjusting for the factors presented in Table 1. First, the unadjusted models show that the odds ratios comparing immigrant groups to the Swedish-born were all statistically significant, for both waves. For example, immigrants from Middle East had more than five times higher risk of hospitalization during Wave 1 compared to the Swedish-born (Table 2). Second, after adjusting for all included factors, the odds ratios for immigrant groups all decreased, but almost all remained significantly increased compared to the Swedish-born. In addition, the adjusted models show that older age, male sex (Wave 1), pre-existing medical conditions, essential workers, unemployed or unregistered occupation (Wave 2), high income (Wave 2), low education level, and residence in certain districts (Wave 1) is significantly associated with higher odds of COVID-19 hospitalization.Table 2. Estimates from univariable and multivariable logistic regression models from relevant factors by waveFactorsWave 1Wave 2Univariable OR (95% CI)Multivariable OR (95% CI)Univariable OR (95% CI)MultivariableOR (95% CI)Country/region of Birth SwedenReferenceReferenceReferenceReference Africa5.00***(4.21, 5.95)2.76***(2.26, 3.37)3.11***(2.42, 4.00)2.34***(1.77, 3.09) Middle East5.80***(5.04, 6.69)3.17***(2.69, 3.75)5.05***(4.23, 6.03)3.48***(2.83, 4.29) Other Nordic Countries2.37***(1.67, 3.37)1.48*(1.04, 2.11)2.69***(1.81, 4.00)1.63*(1.09, 2.43) Other European Countries1.57***(1.23, 2.00)1.23(0.96, 1.59)1.89***(1.45, 2.47)1.70***(1.29, 2.24) The Rest3.61***(3.04, 4.29)2.76***(2.30, 3.32)3.12***(2.51, 3.87)2.73***(2.16, 3.43)Age Age1.13***(1.08, 1.18)1.13***(1.07, 1.19) Age^21.00**(1.00, 1.00)1.00*(1.00, 1.00)Sex FemaleReferenceReference Male1.77***(1.58, 1.98)1.50***(1.30, 1.73)Pre-existing Medical Conditions NoReferenceReference Yes2.40***(2.11, 2.72)2.31***(1.98, 2.70)Occupation Type Not FrontlineReferenceReference Frontline1.52***(1.31, 1.75)1.36**(1.14, 1.63) Missing1.17(0.99, 1.37)1.26*(1.04, 1.54)Income Level HighReferenceReference Medium0.98(0.84, 1.14)0.95(0.79, 1.13) Low0.96(0.81, 1.13)0.81*(0.65, 1.00)Education Level Post secondaryReferenceReference Upper secondary1.23**(1.09, 1.40)1.18*(1.01, 1.39) Primary1.39***(1.17, 1.64)1.39**(1.13, 1.72) Missing1.50*(1.00, 2.26)0.96(0.51, 1.82)District BrommaReferenceReference Brännkyrka1.17(0.82, 1.69)1.07(0.72, 1.59) Enskede1.17(0.80, 1.73)0.92(0.59, 1.43) Essinge1.07(0.52, 2.17)0.27(0.06, 1.10) Farsta0.86(0.59, 1.25)1.11(0.76, 1.63) Hägersten1.22(0.86, 1.72)0.85(0.57, 1.27) Hässelby1.29(0.90, 1.85)0.77(0.49, 1.20) Högalid0.93(0.59, 1.48)0.72(0.42, 1.23) Kista1.32(0.93, 1.88)0.86(0.57, 1.32) Kungsholmen0.71(0.39, 1.31)0.79(0.42, 1.47) Skarpnäck1.07(0.73, 1.56)1.07(0.71, 1.60) Skärholmen1.63**(1.16, 2.29)0.98(0.66, 1.47) Spånga1.86***(1.35, 2.56)0.80(0.54, 1.19) Stockholms Adolf Fredrik2.01*(1.16, 3.50)0.79(0.34, 1.86) Stockholms Engelbrekt0.87(0.52, 1.46)0.49*(0.25, 0.97) Stockholms Gustav Vasa0.80(0.42, 1.54)0.65(0.30, 1.38) Stockholms Hedvig Eleonora1.04(0.53, 2.05)1.25(0.65, 2.43) Stockholms Katarina1.21(0.81, 1.81)0.79(0.49, 1.29) Stockholms Maria Magdalena0.93(0.55, 1.57)1.14(0.68, 1.93) Stockholms Oscar0.87(0.54, 1.39)0.75(0.44, 1.26) Stockholms Sankt Göran0.80(0.52, 1.25)0.71(0.44, 1.17) Stockholms Sankt Johannes1.26(0.70, 2.28)1.42(0.78, 2.59) Stockholms Sankt Matteus0.78(0.47, 1.29)0.83(0.49, 1.41) Stockholms Sofia0.67(0.41, 1.08)0.75(0.46, 1.23) Stockholms domkyrkodistrikt0.88(0.35, 2.21)0.58(0.18, 1.88) Vantör1.27(0.90, 1.80)0.94(0.63, 1.40) Vällingby1.18(0.81, 1.72)0.90(0.59, 1.39) Västerled1.37(0.89, 2.11)0.65(0.36, 1.17)All tests were two-sided. No adjustment was made for multiple comparisons.*p < 0.05; **p < 0.01; ***p < 0.001.
Decomposition of factors contributing to COVID-19 hospitalization disparities between immigrants by region of birth and Swedish-born
To quantify the contribution of each factor to the differences in risk between immigrants and Swedish-born, we next performed the decomposition analysis; the results are shown in Figs. 2, 3. Figure 2 shows the risk differences (gray bars on the left panels) and the proportion explained by the composition effect (blue bars on the right panels). For Wave 1, the largest composition effect was observed for Other European Countries (48%; percentages regarding the composition effects are given in Supplementary Table S4, S5) followed by Other Nordic Countries (44%). Despite having the highest risk differences, Middle East and Africa exhibit slightly lower explained percentages (18% and 21%, respectively). Only 12% of the difference observed in immigrants from Rest of the World is contributed from composition effect. In Wave 2, the risk differences decrease substantially for most groups, except for Other European Countries. The composition effects, meanwhile, decrease for all groups. These results show that the differences in the distribution of the seven included factors between Swedish-born and immigrant groups cannot fully explain the disparities in the risk of hospitalization, in particular for immigrants from outside Europe.Fig. 2. Total and explained share of hospitalization risk differences between Swedish-born individuals and immigrants.Hospitalization risk differences (percentages) between Swedish-born individuals and immigrants by region of birth, and the percentage of these differences attributed to the composition effect for (a) Wave 1 and (b) Wave 2. Data are available in Supplementary Information Tables S4, S5.Fig. 3. Contributions of factors to hospitalization risk differences.Contribution of each factor to the hospitalization risk difference (bars reflect estimated attributable percentages with 95% of confidence interval) between Swedish-born individuals (Wave 1: n = 438,218; Wave 2: n = 437,953) and immigrants categorized by region of birth: a Africa, b Middle East, c Other Nordic Countries, d Other European Countries and (e) the Rest of the World. All data are available in Supplementary Information Tables S4, S5.
Figure 3 shows the contributions of each factor to the risk differences. The seven studied factors (age, sex, pre-existing medical conditions, occupation type, income level, education level, residential district) contribute differently to hospitalization risk differences across immigrant groups. For immigrants from Other Nordic and Other European Countries, whose composition effects are relatively high, the main contributions come from age (Other Nordic Countries: 33% in Wave 1, 31% in Wave 2; Other European Countries: 16% in Wave 1, 11 % in Wave 2; percentages given in Supplementary Table S4, S5). This shows that the age structure of these two immigrant groups accounts for a substantial part of the risk difference. For Other Nordic Countries, another substantial contribution comes from larger proportion of pre-existing medical conditions (11% in Wave 1, 9% in Wave 2). For Other European Countries, additional substantial contributions come from occupation type (11% in Wave 1, 7% in Wave 2) and residential district in Wave 1 (22%). For the other three immigrant groups, the contributions from the seven factors are smaller, but some common patterns emerged. Occupation type contributes significantly to both waves (Africa: 5% in Wave 1, 7% in Wave 2; Middle East: 3% in Wave 1, 3% in Wave 2; Rest of the World: 4% in Wave 1, 4% in Wave 2) and residential district contribute significantly for Wave 1 (Africa: 9 %; Middle East: 7%; Rest of the World: 5%) but not for Wave 2. For all immigrant groups, the magnitude of contribution varies, but some common patterns are observed. Age and occupation type contribute significantly to all the groups for both waves. Residential district contributes positively to Wave 1 for all groups, but the contribution dramatically decreases and becomes non-significant in Wave 2. We also note that some previously studied factors, income and education, have limited or no significant contributions to the composition effect.
Since the composition effects are relatively small for groups with high differences in hospitalization risks, (e.g., Africa and Middle East), the coefficient effects will be high for these groups. To better understand the contributions to the coefficient effects, we ran logistic regression models for each group separately (Fig. S2 in the Supplementary file). The wide range of odds ratios shown in this figure suggests that a given factor may contribute to the hospitalization risk differently for the different groups. The coefficients for several residential district for Africa and Middle East are higher compared to the Swedish-born population, indicating that living in certain districts carries additional risks for immigrants compared to the Swedish-born. Another interesting thing to note is that the risk of people without education records is remarkably higher among the Swedish-born than among the overall population or immigrant groups. This may suggest that, despite being categorized similarly in terms of educational attainment, Swedish-born individuals without education records differ in important ways from their immigrant counterparts—differences that may contribute to their elevated risk.
Discussion
Our results show that immigrant groups had much higher risk of hospitalization for COVID-19 compared to the Swedish-born. The main aim of our study was to examine the extent to which the differences in the distribution of factors (group composition) explain these differential risks. The results showed that group composition contributed to varying degrees and with different patterns in different groups of immigrants. Overall, group composition contributed less to the (large) disparities in hospitalization among the most affected immigrant groups compared to the Swedish-born (21%, 16% for Africa and 18%, 11% for Middle East, in Wave 1 and Wave 2, respectively). In contrast, it contributed considerably more to the (small) disparities in hospitalization among Other European and Other Nordic Countries compared to the Swedish-born (48%, 44%, for Wave 1 and 13%, 39% for Wave 2, respectively). Assuming that occupation type and residential district primarily indicate exposure to the virus, and age and pre-existing medical conditions indicate susceptibility, our results suggest a combination of exposure and susceptibility explains differences for Other European Countries and non-European immigrants, while susceptibility plays a greater role for Nordic immigrants. In fact, for the latter group, pre-existing medical conditions explain more than forty percent of the hospitalization risk difference.
Our study examines whether the distribution of social determinants explains group differences. The decomposition analysis allowed us to explicitly estimate how the differential distributions of each factor contribute to the differences in hospitalization risk across groups. The small overall composition effect observed among immigrants from Africa and Middle East (i.e., from the origins most affected by the pandemic) means that the differential distribution of the included factors did not fully explain the risk differences. Such phenomenon does not only appear in Swedish context, but has been seen in studies conducted in other Nordic countries^19,20^. Therefore, further studies are needed to identify additional factors, such as social contacts, that may account for the differences in risks between immigrant groups and the local-born. Several aspects of social contacts are relevant for disease transmissions, not only the number of interactions but also the type^21–23^. Moreover, given the nonlinear nature of infectious disease outbreaks, even small differences in social contacts may play a major role in disease spread in communities^24–26^. Social contacts may also influence viral load, which has been shown to increase the risk of hospitalization^27^. However, these contacts are hard to measure at the population level, and it is not known if differences in the social contacts between immigrant groups and the Swedish-born can explain the differential disease transmission. This remains an important line of research for future studies.
Although most factors contribute relatively equally across both waves, residential district was a notable exception: it had a positive contribution for all groups in Wave 1 and none in Wave 2. This is an interesting finding, and is likely due to the geographical clustering of multiple factors increasing community transmission early in the pandemic. Indeed, the two most affected immigrant groups during the pandemic had a high proportion of essential workers, were clustered in a few residential districts, and were also more likely to live in multigenerational households^8^. This combination likely increased the likelihood of exposure and facilitated both within-household and community transmission, thereby raising hospitalization risks in these districts. The contributions of residential districts diminish during Wave 2, likely reflecting a more uniform spread of the virus in this wave, something also indicated by the distribution of hospitalization risks shown in Fig. 1.
The relatively small contribution of pre-existing medical conditions to differences in the risk of hospitalization among immigrants from Africa and Middle East compared to the Swedish-born is noteworthy. To our knowledge, this is the first study that includes also health conditions (HIV, tuberculosis, and sickle cell disease) related to severe COVID-19 disease, and known to be more prevalent among immigrants^28^. The fact that pre-existing medical conditions contribute only a small portion to differences in the risk of hospitalization may be explained by the fact that the health indicator we use is only a partial indicator of pre-existing medical conditions (i.e., only based on specialized out-patient and in-patient care). Overall, this result gives further support to the interpretation that susceptibility was not the main driver of the higher hospitalization risk among immigrants^28^. Regarding exposure, the factors we used do not include social contacts, likely to be of high importance for transmission. As such, the relatively small contribution of the factors indicative of exposure in our study cannot be used to rule out the possibility that differential exposure was a main driver.
Our study uses high-quality, population-based national register data, and, to the best of our knowledge, the first to link COVID-19 hospitalization outcomes with some key socioeconomic factors and pre-existing medical conditions while incorporating both individual and contextual risk factors. By doing so, we provide a comprehensive overview of the key determinants of hospitalization disparities between immigrants and the Swedish-born. Furthermore, while traditional logistic regression analyses can determine whether differences between groups persist after adjusting for covariates, they do not quantify the relative contribution of each factor to the overall differences between groups. In contrast, the decomposition analysis quantifies how much of the observed differences in hospitalization risks between immigrants and Swedes can be attributed to group composition. This quantification indirectly allows us to identify the contribution of each factor or determinant. Importantly, the decomposition analysis allows for examining the heterogeneity across immigrant groups, revealing how some determinants (such as occupation type and residential areas) can have a significant impact on hospitalization risk differences in certain immigrant groups, while others (such as income and education) have relatively little effect across all groups.
Our study also has limitations. First, our categorization of certain variables may have resulted in grouping individuals with different risks into the same category. In terms of occupation, for example, taxi driver and medical doctors are both categorized as essential occupations, but taxi drivers likely had less access to protective measures compared to medical doctors, thereby having a higher risk of being infected. Similarly, individuals without registered occupations, a category more prevalent among immigrants, may also consist of occupations with different risks. The same limitation may also apply to the categorization of education. For example, Swedish-born individuals classified in the “missing” category show remarkably higher risks compared to those in the same category for migrant groups (Fig. S2 in the Supplementary file). The higher percentage of individuals without education records among immigrants may partly reflect the extra steps involved in verifying or transferring foreign educational qualifications to Swedish registers. Therefore, many immigrants under this category may have actually completed some form of formal education. However, regarding Swedish-born individuals without education records, it is possible that a substantial proportion were unable to complete formal education due to various disabilities—disabilities that may also have increased their risk of severe COVID-19 complications. As a result, the exposure risks for the Swedish-born and immigrants under the same category can be largely different. These shortcomings could lead to a biased estimation of the composition effect. Second, the outcome used (hospitalization) reflects a combination of exposure and susceptibility, and given the available data, we cannot quantify the relative importance of the two with precision. Third, a common limitation of migration studies is the under-reporting of emigration to the authorities, which can lead to an artificial health advantage. However, the travel restrictions during the pandemic suggest that this limitation may be less problematic in our study. Finally, although concerns about misclassification in COVID-19 mortality among immigrants have been raised in studies conducted in Sweden^29^, we do not expect such misclassification to substantially affect hospitalization data. During the pandemic, individuals in hospitals were more likely to be tested, which reduced the likelihood of misclassification. However, we cannot rule out the possibility that certain barriers to healthcare access some immigrants may have led to an underestimation of hospitalizations. In any case, this limitation is unlikely to affect the overall conclusions of our study.
Although immigrants in other receiving contexts have also shown higher risks of COVID-19 mortality and morbidity, the results of our study should not be generalized to other settings. In fact, further studies should be conducted to determine whether similar patterns are observed in other national contexts and among other immigrant groups. Such as research would further contribute to understanding the contribution of social determinants to COVID-19 disparities by nativity.
Overall, immigrant background has been shown to be an important risk factor for COVID-19 mortality and morbidity. Our study shows that the increased risk can only partly be accounted for by the differential distribution of the observed social determinants often used in public health research.
Supplementary information
Transparent Peer Review file Supplementary material Description of Additional Supplementary Files Supplementary Data 1 Supplementary Data 2
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Hintermeier, M. et al. COVID-19 among migrants, refugees, and internally displaced persons: systematic review, meta-analysis and qualitative synthesis of the global empirical literature. e Clinical Medicine 74, (2024).10.1016/j.eclinm.2024.102698 PMC 1170148439764176 · doi ↗ · pubmed ↗
- 2Jann, B. FAIRLIE: Stata module to generate nonlinear decomposition of binary outcome differentials. Boston College Department of Economics (2006).
- 3Liu, Y. et al. What are the underlying transmission patterns of COVID-19 outbreak? An age-specific social contact characterization. e Clinical Medicine 22, (2020).10.1016/j.eclinm.2020.100354 PMC 716529532313879 · doi ↗ · pubmed ↗
- 4Ma, Y. Python and STATA scripts for the role of social determinants in COVID-19 hospitalization disparities by migration status in Stockholm, Sweden. Zenodo. 10.5281/zenodo.17570578 (2025).10.1038/s 43856-025-01357-w PMC 1288699541540120 · doi ↗ · pubmed ↗
