The relationship between general practice characteristics, case-mix, and secondary care attendances/admissions before and after the COVID-19 pandemic: Protocol for an OpenSAFELY cohort study
Mengxuan Zou, Shrinkhala Dawadi, Luisa M Pettigrew, Rosalind M Eggo, Emily Herrett, Venexia Walker, Michael Marks, Jonathan Sterne, Alex Walker, Arina Tamborska, Jaidip Gill, John Macleod, Johnny Filipe, Heather Mah, Sebastian Bacon, Matt Curtis, Amir Mehrkar, Ruth Costello

TL;DR
This study examines how general practice features and patient characteristics affect hospital use before and after the pandemic in England.
Contribution
The study provides insights into how general practice and patient factors influence winter hospital pressures, including pre- and post-pandemic trends.
Findings
Practice characteristics and patient case-mix are linked to secondary care use patterns.
Pre- and post-pandemic data reveal changes in A&E attendances and hospital admissions.
Findings will inform health system planning and understanding of hospital demand drivers.
Abstract
Healthcare services in England experience increased pressure during winter months due to seasonal infectious diseases, increased multimorbidity, and fluctuating demand. Understanding how characteristics of general practices, and their registered patient case-mix contribute to secondary care use—particularly for Ambulatory Care Sensitive Conditions (ACSCs)—is essential for planning and resource allocation. Primary and secondary care activity also significantly changed during the COVID-19 pandemic, and not all activity-types have returned to pre-pandemic levels in the years since, making it critical to examine trends across both pre-and post-pandemic periods. OpenSAFELY-TPP was used to access linked electronic health record data, covering approximately 2,600 general practices (about 40% of all practices in England) and 26 million registered patients in England using TPP SystmOne software…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
|
|
|
| |
|---|---|---|---|
| Cohort start date | 1 Oct 2018 | 1 Oct 2022 | 1 Oct 2023 |
| Cohort end date | 28 Feb 2019 | 28 Feb 2023 | 29 Feb 2024 |
|
| |||
| Consultation provision rate:
|
| ||
| % of eligible patients who are vaccinated for:
|
| ||
| Patient list size; Practice geographic region; |
| ||
| % registered patients:
| |||
| Average Cambridge Multimorbidity score (CMS) per practice
|
| ||
|
| |||
| Weekly rate of A&E attendances per 100,000 registered patients at the practice |
| ||
| Weekly rate of hospital admissions per 100,000 registered patients at the practice | |||
|
| |||
| Weekly rate of A&E attendances for ACSCs per 100,000 registered patients at the practice | Same definition and structure as the primary outcomes, but will be restricted to the primary five Ambulatory Care Sensitive Conditions (ACSCs):
| ||
| Exposures | Definition | Data sources (see
|
|---|---|---|
|
| ||
| Practice geographic region | North East;
| Primary care |
| Practice list size | Total number of patients registered at the practice. | Primary care |
| Consultation provision rate | The monthly consultations at each practice, per 1000 registered patients, summed over 12 months prior to the flu & winter months. | Primary care |
|
| ||
| Sex | Proportion of registered patients by sex: male, female, unknown. | Primary care |
| Age | Proportion of patients in each age band: 0–5, 5–15, 65+, 75+, 85+ years, unknown. | Primary care |
| Ethnicity | Proportion of patients classified as White, Mixed, Asian or Asian British, Black or Black British, Chinese or Other, Unknown. | SNOMED |
| Deprivation | Proportion of patients living in each IMD deprivation quintile: 1 (most deprived), 2, 3, 4, 5 (least deprived), unknown; median IMD quintile of registered patients may also be considered | Index of multiple deprivation |
| Rurality | Proportion of patients living in each rurality classification: 1 (Urban major conurbation), 2 (Urban minor conurbation), 3 (Urban city and town), 4 (Urban city and town in a sparse setting), 5 (Rural town and fringe), 6 (Rural town and fringe in a sparse setting), 7 (Rural village and dispersed), 8 (Rural village and dispersed in a sparse setting), unknown. | Primary care |
| Current smoker proportion | Proportion of patients recorded as current smokers |
|
| Obesity proportion | Proportion of patients with a diagnosis of obesity or BMI >30 |
|
| Cambridge Multimorbidity Score (CMS) - mean CMS per patient in each practice |
| SNOMED |
| Vaccination - Flu | Proportion of eligible registered patients who received the vaccine in primary care for the year prior to flu & winter months | SNOMED flu vaccination codes |
| Vaccination - COVID-19 |
| SNOMED COVID-19 vaccination codes |
|
|
|
|---|---|
| Asthma | ICD10: asthma |
| COPD, including bronchitis | ICD10: COPD |
| Diabetes | ICD10: diabetes |
| Hypertension | ICD10: hypertension |
| Angina | ICD10: angina |
- —Wellcome Trust
- —NIHR Health and Social Care Delivery Research programme
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHealth disparities and outcomes · Global Health Care Issues · Climate Change and Health Impacts
Background and rationale
Primary and secondary care services in England experience increased pressures during the winter. Multiple factors can contribute to these pressures, including seasonal epidemics of respiratory infectious diseases such as COVID-19, influenza (flu), and respiratory syncytial virus (RSV) ^ 1 ^, increased cardiovascular disease incidence ^ 2, 3 ^ and the increasing healthcare needs of a growing and ageing population with multiple long-term conditions ^ 4 ^. However, despite the increasing demand, the capacity of primary care has not scaled correspondingly due to a decrease in qualified workforce numbers between 2015 and 2022 ^ 5 ^.
Pressures at the primary and secondary care levels are likely interrelated, with heightened primary care pressures and unmet demand having downstream consequences in secondary care, potentially significantly impacting resources, costs, and the quality of secondary care services. In England, unplanned hospital admissions account for 67% of hospital bed days and cost £12.5 billion annually (Department of Health, 2013) ^ 6 ^. Managing the demand for unplanned admissions has become a critical policy priority, with general practice playing a vital role in mitigating this burden.
Ambulatory Care Sensitive Conditions (ACSCs) are conditions for which timely and effective primary care can prevent hospital admissions, for example: asthma, diabetes, chronic obstructive pulmonary disease (COPD), hypertension, and angina ^ 7 ^. There is considerable variability in ACSC admission rates across general practices, even after accounting for patient demographics and chronic disease prevalence ^ 8, 9 ^. This suggests that practice-level factors, such as consultation provision rates, list size, region and rurality, and area-level deprivation may significantly influence secondary care utilisation ^ 10 ^.
Existing evidence highlights that practice-level characteristics (such as consultation provision rate) and patient case-mix (such as age), may be associated with variations in ACSCs admissions ^ 11, 12 ^. Describing the association between these practice characteristics and secondary care admissions is critical for developing strategies to manage demand and reduce pressures across the healthcare system.
Aim
To investigate the relationship between general practice-level characteristics, including practice characteristics, patient sociodemographics and case-mix, and registered patients' use of secondary care services, and if this varies over time by ACSC and patient age.
Research questions
1.Are there variations in weekly hospital attendance/admission across general practices during the flu and winter months (October - February) pre-pandemic (2018–2019) and post-pandemic (2022–2024)?2.Are general practice characteristics and registered patient sociodemographics & case-mix associated with the weekly rate of hospital attendance/admission during the flu and winter months? Have these associations changed over time in the pre-pandemic (2018–2019) and post-pandemic periods (2022–2024), or by patients’ age?3.Are the above characteristics and case-mix associated with the rate of hospital attendance/admissions for ACSCs (asthma, diabetes, COPD (including bronchitis), hypertension, and angina) vs non ACSCs during the flu and winter months, and have these associations changed over the pre-pandemic and post-pandemic periods, or by patients’ age?
Methods
Data sources
This study will use general practice electronic health records (EHRs) accessed via the OpenSAFELY-TPP platform. General practice data will be linked to the following data:
Secondary Uses Services (SUS)
- SUS Admitted Patient Care
- SUS Emergency Care Office of National Statistics (ONS) death registrySecond Generation Surveillance System (SGSS)Index of Multiple Deprivation (IMD) - patient-level
Study population
Our study population consists of approximately 2,600 general practices in England (with about 26 million registered patients) between 2018 and 2024. This represents about 40% of general practices in England, and 42.6% of England’s population ^ 13 ^. All practices providing EHR data will be included in the analysis (i.e. regardless of whether they close or merge during the study period), as this is a descriptive study aiming to provide snapshots of real-world patterns in general practice characteristics and case-mix, and secondary care attendances/admissions. Patients will be included if they are alive and registered in an English general practice using the EHR software provider TPP at the start of each study period.
Timeframes and cohorts
We will describe the data for the time periods before and after the COVID-19 pandemic, as the pandemic caused significant disruptions to general practice consultations ^ 14 ^ and secondary care attendances/admissions ^ 15, 16 ^:
Pre-pandemic: January 2018 to March 2020Post-pandemic: April 2022 to March 2024
We will assess the associations between general practice characteristics and secondary care admissions/attendances over 5 months, encompassing the flu and winter seasons (October to February, inclusive). Full details of the overall study periods for each cohort, including the specific timeframes for exposure and outcome variables, are provided in Table 1.
Exposure variable definitions
All the exposure variables will be regarded as time in-variant and assessed either on or before the flu and winter months (as detailed in Table 1), depending on the required observation timeframes. Definitions for each exposure variable are provided in Table 2, encompassing both practice-level characteristics and registered patient sociodemographic and clinical case-mix. A summary is provided below:
Practice-level characteristics consist of the region of practice location (nine regions defined using the International Territorial Level 1 region of England), registered patient list size, and consultation provision rate. Patient list size is defined at the start of the time period specified in Table 1, and is used as the denominator for all practice-level measures to facilitate comparisons across practices. Consultation provision rate will be calculated for the full year preceding the flu and winter months, to reflect overall practice activity.
Registered patient sociodemographic variables are calculated as proportions of the total registered patient population within each practice at the start of the flu and winter months. These include the proportion of:
Female patients (sex);Patients aged 0 to ≤5 years, 5 to ≤15 years, 65 to ≤75, 75 to ≤85, and ≥85 years (age);Patients classified as White, Mixed, Asian or Asian British, Black or Black British, Chinese or Other Ethnic Groups (ethnicity);Patients residing in different socioeconomically deprived quintiles, as defined by the Index of Multiple Deprivation (IMD; deprivation);Patients living in urban or rural areas, based on rurality classification.
Depending on the distribution of the data, deprivation may also be summarised using the median IMD quintile of registered patients within each practice. Similarly, rurality may be converted from a proportion-based measure to a binary categorical variable (urban/rural) if most patients within a practice share the same classification. The proportion of patients with missing data on sex, age, ethnicity, deprivation (IMD), or rurality will be reported to assess the completeness of sociodemographic information across practices.
Registered patient case-mix variables are also expressed as proportions of the total registered patients at each practice. These include:
The proportion of current smokersThe proportion of patients with a BMI >30 or a recorded diagnosis of obesity.The multimorbidity burden, which is quantified using the mean (or median, as appropriate) Cambridge Multimorbidity Score (CMS) per patient within each practice. The CMS is a validated, weighted index capturing disease burden based on the presence of multiple long-term conditions. It was derived using a large UK-based primary care cohort (n=300,000) and has been recently validated in two additional UK datasets ^ 17 ^.
Outcome variable definitions
This study will examine weekly and aggregated rates of secondary care use during flu and winter months (October to February) in relation to general practice characteristics and patient case-mix.
** Research questions 1 & 2 **
The primary outcomes for research questions 1 and 2 are the weekly rates of A&E attendances and hospital admissions during the flu and winter months (October to February). Aggregated winter-month rates may also be considered. Rates will be reported per 100,000 registered patients at the general practice and derived using:
A&E attendances: Emergency Care Data Set (ECDS)Hospital admissions: Admitted Patient Care (APC) dataset.
** Research question 3 **
The secondary outcomes, aligned with research question 3, are weekly rates of A&E attendance and hospital admissions specifically for selected ACSCs: asthma, diabetes, COPD (including bronchitis), hypertension, and angina ( Table 3). Aggregated winter-month rates will also be considered.
Statistical analysis
** Research question 1 **
Research question 1 will describe variation in hospital attendance and admission rates across practices. We will summarise general practice characteristics, including registered patient sociodemographics and case-mix. These characteristics will be described at the start of each flu and winter period. Median and interquartile ranges or means with standard deviations will be presented, as appropriate. Missing data will also be summarised for each exposure variable. Primary care consultation provision rates (defined in Table 1) will be presented overall and stratified by patient characteristics (e.g. age bands, sex, CMS quintiles) and practice-level factors (e.g. region).
Secondary care rates (defined in Table 1) will be described over the entire flu and winter months to illustrate patterns and variations over time. Rates for hospital admissions and A&E attendances will be presented separately; hospital admission rates will be further categorised by ACSC or non-ACSCs admissions.
We will additionally describe the distribution of general practice characteristics for practices with and without ACSC-related admissions. This output will illustrate how these characteristics differ by ACSC and non-ACSC admissions (see Table S1 in Extended data) ^ 18 ^.
** Research question 2 **
To address research question 2, we will examine the associations between general practice-level characteristics, registered patient case-mix, and the rates of A&E attendance and hospital admissions (see Figure S1 in Extended data) ^ 18 ^. The choice of regression model will be informed by the distribution and shape of the outcome data. We will initially explore the suitability of linear models and, if necessary, evaluate alternative model forms such as Poisson or negative binomial models. We will also account for the correlation structure of our longitudinal outcome data by using general estimating equations (GEE). The final choice of model form will be selected based on the data structure and goodness-of-fit testing. As a sensitivity analysis to the main analysis, we will fit a mixed effects model to our data, to capture general practice-specific variability (e.g. through random effects).
** Research question 3 **
To address research question 3, we will repeat the analyses described for research question 2 with ACSC-related hospital admissions and A&E attendances as outcomes, and separately for non-ACSCs. This will allow us to explore whether associations between practice characteristics and secondary care use differ for potentially avoidable admissions.
** Subgroup analysis **
We will conduct subgroup analyses restricted to practices with a high proportion of patients in specific age groups: 0 to ≤5 years; 5 to ≤15 years; 65 to ≤75 years; 75 to ≤85 years; and ≥85 years. High proportion will be defined empirically using the 95th percentile of the distribution across practices for each age group. These analyses will explore whether associations vary across practices with differing age compositions and will be conducted for the post-pandemic period only as more recent findings will have greater policy relevance for health system planning.
** Sensitivity analyses **
We will compare outcomes defined using our primary time window (October to February) with those based on the flu season as defined by the UK Health Security Agency (UKHSA), since the timing of flu activity can vary across years. This will allow us to evaluate whether our fixed study period adequately captures true seasonal flu activity, and whether any potential mismatch meaningfully affects observed rates of secondary care use.
We will also flag, and account for weeks affected by unstable external factors, such as extreme weather events or public holidays, which may influence secondary care use. Weeks with severe weather will be identified using the UK Met Office’s National Severe Weather Warning Service ( https://www.google.com/url?q=https://www.metoffice.gov.uk/research/library-and-archive/publications/national-severe-weather-warning-service).
Patient and Public Involvement and advisory input
We formed a Patient and Public Involvement and Engagement (PPIE) advisory group to provide feedback throughout the duration of this project. The group comprised seven people from across England with lived experience of accessing primary and secondary care services, including some members with multiple long-term conditions. We first met online in February 2025 to discuss this research. The advisory group members provided helpful input based on their experiences. For example, they suggested accounting for the impact of severe weather events and holiday weekends in our study, which we have now incorporated into our sensitivity analyses. We paid the members for their time, following NIHR guidelines.
Study status and dissemination
We are currently finalising data management and wrangling for this study. The study population has been defined, and all necessary variables have been created. Data checks are ongoing. We plan to begin the analysis described here during the week commencing 10 June 2025.
Data management has been carried out using ehrQL (Electronic Health Records Query Language; more information available at https://docs.opensafely.org/ehrql/), with analyses conducted using R version 4.5.0 and Stata version 19.0. All code is openly shared under an MIT open license to support transparency and reuse. Once the analyses described in this document are complete, all data management and analysis code will be archived at: https://github.com/opensafely/WinterPressuresDescriptive/. Clinical and medicines codelists used in this study are openly available at https://codelists.opensafely.org/.
Discussion
This protocol outlines a large-scale, population-based cohort study designed to investigate the relationship between general practice characteristics, registered patient case-mix, and secondary care attendances and admissions during winter months, both before and after the COVID-19 pandemic. Leveraging the OpenSAFELY-TPP platform, we will examine data from approximately 2,600 general practices across England, representing over 40% of all practices and nearly 43% of the national population. This provides a unique opportunity to explore real-world variation in healthcare use and to generate evidence that may inform service planning and policy. These insights may also help to identify modifiable practice-level drivers of high hospital use, particularly for ACSCs, where timely and effective primary care has the potential to reduce avoidable admissions.
As this is not a patient-level cohort analysis, we have treated all exposure variables as time-invariant, measured on or before the start of each winter period. This approach reflects the assumption that key practice-level characteristics — such as patient age structure, multimorbidity burden, or smoking prevalence — are unlikely to change substantially in the short term. This allows for consistent comparisons across flu seasons while capturing meaningful differences in practice context and case-mix.
Although consultation provision rate is a practice-level measure, it is derived from patient-level data, and the composition of registered patients can fluctuate throughout the year as individuals join or leave practices. To reflect underlying practice activity rather than short-term fluctuations, while ensuring a stable estimate of routine activity, we will calculate monthly consultation provision rates (defined as the number of consultations per 1,000 registered patients each month) and aggregate them over the 12 months prior to each winter period. This approach balances temporal accuracy and stability, providing a more representative estimate of usual practice workload, and avoiding bias from short-term variation or seasonal spikes in demand.
To maximise sample size and preserve representativeness, we have not applied a 90-day prior registration requirement for patients. While such thresholds are often used in patient-level studies to ensure completeness of individual records following registration, our focus is on practice-level characteristics and demand. Excluding patients based on short registration durations could omit relevant workload pressures — particularly for times or areas of high population movement — and is therefore not necessary for this descriptive practice-level analysis.
A key strength of this study is the inclusion of both pre-pandemic and post-pandemic time periods. We have chosen to define the post-pandemic period as April 2022. Although the World Health Organization officially declared the end of the global COVID-19 public health emergency in May 2023, our earlier start date better reflects the situation in the UK. From February 2022, health services in the UK adopted a “living with COVID-19” policy, and service delivery in both general practice and hospitals began operating under more routine conditions ^ 19 ^. The COVID-19 pandemic may lead to substantial and lasting changes in both primary and secondary care ^ 20 ^. Defining one cohort for each post-pandemic year allows us to better describe and compare how the relationship between general practice context and secondary care use may have changed across the post-pandemic era. These findings will support a better understanding of how structural and case-mix factors contribute to winter pressures and inform future health service planning.
We have incorporated feedback on study design from our PPIE advisory group, for example, identifying relevant exposures and sensitivity analyses (such as accounting for extreme weather or holiday weeks). This feedback has helped ensure that our research questions and analytic approach are grounded in real-world experiences, and that our findings will be applicable to health policy and clinical practice. We will continue consulting with our advisory groups throughout the study.
In summary, we have outlined a protocol to explore the characterisation of variation in winter-time secondary care use across English general practices and identify potential practice-level drivers of higher or lower service demand. The results may support future interventions aimed at improving integration between primary and secondary care, reducing avoidable hospital use, and enhancing preparedness for seasonal healthcare pressures.
Ethics and information governance
Ethics approval received from HRA and Health and Care Research Wales (HCRW) on 04 December 2024, REC reference: 24/HRA/5020.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Health and social care: winter update - Hansard - UK parliament.2025; [cited 2025 Jan 27]. Reference Source
- 2Azam S Jones T Wood S : Improving winter health and well-being and reducing winter pressures. A preventative approach. Public Health Wales, Cardif,2019;62. Reference Source
- 3Stewart S Keates AK Redfern A : Seasonal variations in cardiovascular disease. Nat Rev Cardiol. 2017;14(11):654–64. 10.1038/nrcardio.2017.76 28518176 · doi ↗ · pubmed ↗
- 4Mc Kee M Dunnell K Anderson M : The changing health needs of the UK population. Lancet. 2021;397(10288):1979–91. 10.1016/S 0140-6736(21)00229-4 33965065 PMC 9751760 · doi ↗ · pubmed ↗
- 5Pettigrew LM Petersen I Mays N : The changing shape of English general practice: a retrospective longitudinal study using national datasets describing trends in organisational structure, workforce and recorded appointments. BMJ Open. 2024;14(8): e 081535. 10.1136/bmjopen-2023-081535 39227175 PMC 11404227 · doi ↗ · pubmed ↗
- 6Busby J Purdy S Hollingworth W : Opportunities for primary care to reduce hospital admissions: a cross-sectional study of geographical variation. Br J Gen Pract. 2017;67(654):e 20–8. 10.3399/bjgp 16X 687949 27777230 PMC 5198609 · doi ↗ · pubmed ↗
- 7Nuffield Trust: Potentially preventable emergency admissions. [cited 2025 Jan 27]. Reference Source
- 8Busby J Purdy S Hollingworth W : A systematic review of the magnitude and cause of geographic variation in unplanned hospital admission rates and length of stay for Ambulatory Care Sensitive Conditions. BMC Health Serv Res. 2015;15(1): 324. 10.1186/s 12913-015-0964-3 26268576 PMC 4535775 · doi ↗ · pubmed ↗
