Developing patient safety scale for hospitals
Metin DİNÇER, Esra KARATAŞ OKYAY, Yunus Emre KARATAŞ, Erol GÖRAL

TL;DR
This study developed a reliable and valid scale to assess and track patient safety culture in hospitals.
Contribution
The novel contribution is a validated patient safety scale with seven subscales for hospital settings.
Findings
The scale demonstrated good model fit and reliability with a Cronbach’s alpha of 0.921.
Seven subscales were identified, including teamwork, leadership, and error response.
The scale is valid and reliable for measuring patient safety culture among healthcare professionals.
Abstract
The study aimed to contribute to the literature with a reliable and valid scale for hospitals to be used in determining the current patient safety culture and following up on its development. The study was conducted with the participation of 1137 healthcare professionals selected using the convenience sampling method in 3 secondary-care state hospitals and three research and training hospitals, one of which was affiliated with a medical faculty, and two were affiliated with the Health Sciences University. To begin with, to discover the latent structure of the items on the scale, an Exploratory Factor Analysis (EFA) was performed. Additionally, to determine the factor structure of the scale, the Confirmatory Factor Analysis (CFA) method was used. The Cronbach’s alpha coefficient was calculated to check the reliability of the responses. According to Kaiser-Meyer-Olkin (KMO = 0.924)…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPatient Safety and Medication Errors · Medical Malpractice and Liability Issues · Quality and Safety in Healthcare
1. Introduction
Unsafe healthcare practices cause 3 million deaths worldwide per year, as well as costing 606 billion in developed countries [[1](#b1-tjmed-54-02-449)]. In the United States, patient safety violations are reported to increase the length of hospital stay by an estimated 2.4 million days, with an additional cost of 9.3 billion (£7.3 billion; €8.2 billion) [2]. It is estimated that 43.5% of patient safety-related adverse events occurring in the hospital could have been prevented [3]. In hospitals in low- and middle-income countries, 134 million adverse events occur each year, resulting in over 2.5 million lost lives [4]. While 10% of patients in developed countries are likely to be harmed, this rate is estimated to be 18% in African and Eastern Mediterranean countries [5].
Strategies for patient safety based on the principle of protecting patients from preventable harm, i.e. first do no harm [6], are a collective product of individual work and teamwork determined by the attitudes, perceptions, competencies, and behavioral patterns of employees [7]. It is, therefore, necessary to create a patient safety culture that identifies potential errors that could harm patients, consistently reduces the occurrence of errors, and learns from these errors [8]. The most important factors in terms of a patient safety culture are management support and leadership because, without these, a patient safety culture cannot be created [9]. Other topics that will contribute positively to a patient safety culture and that top management should lead are encouraging teamwork, creating working conditions to reduce errors, improving organizational development and communication, not treating errors with a punitive approach, and establishing an appropriate error reporting system [10–16].
In this study, it was aimed to develop a new scale because patient safety remains an essential public health problem [17], developing countries are more prone to patient safety violations [5], and the view of patient safety is different in developed countries due to the fact that patient safety practices have started earlier [18], there is a need for the development and learning of a safety culture in organizations depending on individual, group, and organizational factors and time for this [19], and there is a need for a scale developed within an existing organizational culture to measure the patient safety culture of healthcare professionals as a team. It is seen that the patient safety scales that are currently being used in Türkiye are adapted [20, 21] or focused on specific healthcare professionals such as nurses [22]. For these reasons, there is a need to develop a new instrument to measure the approach of healthcare professionals, to a patient safety culture as a team. Additionally, the scale developed in this study can be used by healthcare organizations that are currently working on the creation of a patient safety culture.
2. Materials and methods
2.1. Ethical aspects of the study
Ethical approval for the study was obtained from Yıldırım Beyazıt University Health Sciences Ethics Board, with the decision dated 07.04. 2022 and numbered 06. The general permission for the institutions where the study would conduct was received from the General Directorate of Health of Public Hospitals under the Turkish Ministry of Health with the decision dated 07.07.2022 and numbered E-32693113-622.03-03-552. This permission was then sent to institutions in the province of Ankara by the letter of the Ankara Provincial Directorate of Directorate dated 06.09.2022 and numbered E-90739940-799-1921. Finally, written permissions for the study were obtained from the hospitals to be included in the study.
The hospitals to be included in the study were selected using the convenience sampling method. Interviews were held with the managers of hospitals, and those who agreed for their institutions to be included were included in the study. The three secondary-care state hospitals which were included in the study have bed capacities of 100, 106, and 250. Of the three research and training hospitals included in the study, the two affiliated with the Health Sciences University have bed capacities of 418 and 760, and the one affiliated with the Medical Faculty of a university in Ankara has a capacity of 264 beds. Healthcare professionals who would participate in the study were selected using the convenience sampling method. The draft scales used in the study were printed on paper and delivered to the management teams of the hospitals, who distributed them to their employees who agreed to participate, collected them back after the participants filled them out, and delivered them to the researchers.
2.2. Data collection instrument
The scales used in the literature and items on these scales were examined. Additionally, the “Quality Standards in Health for Hospitals” issued by the General Directorate of Healthcare Services [23] and the “Standards of Accreditation in Health Hospital Kit” [24] issued by the “Türkiye Healthcare Quality and Accreditation Institute” were analyzed. A 73-item draft item pool under ten subscales was created for the draft Patient Safety Scale for Hospitals. The subscales and the numbers of items under these subscales were: 1-Teamwork, seven items; 2-Working Conditions, twelve items; 3-Organizational Development, five items; 4-Response to Error, four items; 5-Upper Management Support, ten items; 6-Error Reporting, seven items, 7-Communication, twelve items, 8-Patient Transfer and Shift Change, three items, 9-Job Satisfaction, eight items, and 10-Perceived Stress, five items. The draft scale was created as a 5-point Likert-type scale, and the response options of each item were “1- Strongly Disagree”, “2- Disagree”, “3- Undecided”, “4- Agree”, and “5- Strongly Agree.”
The draft scale was reviewed by 12 field experts. While reviewing, the experts were asked to choose one of the four options: “Not Suitable”, “Partially Suitable”, “Quite Suitable”, and “Very Suitable.” If they chose “Not Suitable” or “Partially Suitable”, they were asked to justify their decision to do so. The draft scale was also evaluated by a Turkish language expert. In the next phase, the draft scale was applied in a pilot implementation to 35 healthcare professionals working in a tertiary hospital and a secondary state hospital. Following the pilot implementation, necessary adjustments were made to the draft scale, and considering the number of items on the scale and the expected required time for the respondent to fill out the scale, nine subscales were included. These subscales were: 1-Teamwork, seven items; 2-Personnel, five items; 3-Organizational Learning, four items; 4-Response to Error, six items; 6-Error Reporting, four items; 6-Communication Related to Error, four items; 7-Upper Management Support and Leadership, eight items; 8-Important Patient Information, three items, and 9-Working Environment, three items. The revised draft scale was assessed by a Turkish language expert, and expert opinions were taken into consideration. The study was conducted with this final form of the draft scale.
2.3. Limitations
It is assumed that the answers given by the participants in the study were sincere and accurate. The participants of the study were limited to physicians, nurses, midwives, emergency medical technicians, anesthesia technicians, laboratory technicians, and radiology technicians working in the included hospitals.
2.4. Data analysis
For testing the construct validity of the scale, the exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) methods were used. EFA is a multivariate statistical method that aims to determine multiple interrelated variables and fewer theoretically significant latent variables [25]. It is a type of validity test used in determining the number of factors in scale development studies. The draft scale was filled out by 1180 healthcare professionals, missing data were checked, the data of the participants who did not respond to all items on the scale and those who gave the same response to all items were excluded from the analyses, and 1137 participants in total completely responded to the scale. Among the healthcare professionals who participated in the study, 13.28% (n = 151) were physicians, 74.76% (n = 850) were nurses, midwives, or emergency medical technicians, and 11.96% (n = 136) were anesthesia, laboratory, or radiology technicians.
The entire dataset was randomly divided into two datasets according to the total number of participants, and EFA was applied to one of these two datasets. Through EFA, the final factor structure was obtained, and confirmation was aimed at applying CFA to the other dataset. Researchers frequently use the method of dividing the data in half completely at random. It was reported that randomly splitting the dataset in half in large samples did not create a difference in the results [26]. In the data analyses in this study, EFA was performed using the IBM SPSS 22 software, while the LISREL 8.80 software was employed for CFA. Reliability was tested using the Cronbach’s alpha coefficient.
2.4.1. Exploratory factor analysis
EFA is a factor reduction method, and there are various views regarding the identification of the number of factors. Having an eigenvalue greater than 1, which is known as Kaiser’s K1 rule, is the essential method for deciding on the number of factors [27]. The Kaiser-Meyer-Olkin (KMO) test and Bartlett’s test of sphericity were performed to determine whether the data were suitable for factor analysis [28]. There are also various criteria regarding the factor load values required for items on loaded on factors. According to a previous study, for an item to be related to a factor, the minimum factor load value must be 0.30 [28]. Additionally, when an item is loaded on multiple factors, the difference between the factor loads of the same item must be at least 0.10 [28].
The maximum probability method, which is recommended to be used in large samples as a factor extraction method, and the direct Oblimin method, which is a rotation method, were used in this study. It was previously stated that there are no significant differences between factor extraction methods used in EFA in terms of the number of factors and that methods may differ only if there are low factor load values [27].
An EFA was applied to the data obtained from 578 participants. Items 2, 3, 8, 11, 13, 14, 15, 22, 24, 34, 36, and 42 were inversely scored items, and they were included in the analyses accordingly.
2.4.2. Confirmatory factor analysis
To verify the 39-item and 7-subscale construct obtained after conducting the EFA, a CFA was applied to the second dataset. CFA is a multivariate statistical method that verifies a theorical construct related to a model with a known factor structure [30]. For the CFA to be conducted in this study, new numbers were assigned to the items. The codes of the items were changed to Organizational Learning Development and Communication (OLDC) for subscale 1, Management Support and Leadership (MSL) for subscale 2, Reporting Patient Safety Events (RPSE) for subscale 3, Personnel and Working Hours (NPWH) for subscale 4, Response to Error (RE) for subscale 5, Teamwork (TW) for subscale 6, and Working Environment (WE) for subscale 7. t-values for all scale items being at the significance level of 0.05 and being outside the ±1.96 interval, which is the t-value at the infinite degrees of freedom, show that the factor load value is significant [25]. As the absolute t-values of all items on the scale were greater than 1.96, they were statistically significant, and there were no items that should have been removed from the scale.
The chi-squared value tends to be significant when large samples are used [31]. Therefore, the ratio of the chi-squared value to the degrees of freedom value is recommended for use [32]. A ratio lower than three shows an excellent fit, while a ratio between 3 and 5 shows a good fit [30]. On the other hand, in general, there is no consensus on what a good level of fit is. The aforementioned ratio could be lower than three and can reach up to 5 [33–35]. Another goodness-of-fit index that is used in this method is RMSEA (Root Mean Square Error of Approximation). An RMSEA value lower than 0.08 shows a good fit, while one lower than 0.05 indicates an excellent fit [36]. Other model fit indices that were used in this study were CFI (Comparative Fit Index), NFI (Normed Fit Index), IFI (Incremental Fit Index) [37], and SRMR (Standardized Root Mean Square Residual). A more detailed explanation of these fit indices was reported by Schumacher and Lomax [36].
3. Results
3.1. Exploratory factor analysis results
First, the Kaiser-Meyer-Olkin (KMO) (0.924) and Bartlett’s test (χ^2^ = 9748.777, df = 770, p = 0.000) results were analyzed. Accordingly, it was determined that the data structure and sample were suitable for factor analysis. According to the results obtained by administering the 39-item Patient Safety Scale for Hospitals, eigenvalues were higher than one for seven subscales. All subscales collectively explained 49.2% of the total variance in the measured variable.
The factor loads values of items were found to be between 0.325 and 0.674 for OLDC, between −0.780 and −0.402 for MSL, between 0.509 and 0.761 for RPSE, between 0.311 and 0.739 for NPWH, between 0.342 and 0.423 for RE, between −0.822 and −0.386 TW, and between 0.398 and 0.650 for WE. Therefore, factor loads in all structures were significant. Initially, the draft scale had 44 items. Items 15, 16, 29, and 34 did not yield any significant load on any factor, item 40 was removed from the scale by the researchers, and the EFA results for the 39-item scale are presented in Table 1 as items factor loadings of items and in Table 2 as subscales’ values of eigenvalue, variance explained and Cronbach’s alpha.
3.2. Confirmatory factor analysis results
The t-values of each item that were obtained as a result of the CFA are presented in Table 3. The graph of the standardized path coefficients for the Patient Safety Scale for Hospitals is shown in Figure. The path coefficients were found to be in the ranges of 0.45–0.76 for the OLDC, 0.58–0.80 for the MSL, 0.45–0.81 for the RPSE, 0.31–0.71 for the NPWH, 0.43–0.78 for the RE, 0.69–0.82 for the TW and 0.53–0.67 for the WE. These coefficients were found to be high.
The model data fit results are presented in Table 4. Among the goodness-of-fit indices that were used in the study, The RMSEA value was found as 0.060, showing a good fit. Considering the other goodness-of-fit index values, for the Patient Safety Scale for Hospitals, model data fit was achieved (RMSEA = 0.060, CFI = 0.97, NFI = 0.95, IFI = 0.97, SRMR = 0.060). The ratio of chi-squared-to-degrees of freedom ratio was found as 3.04.
3.3. Reliability
The Cronbach’s internal consistency coefficient of the total scale was found as 0.921. The results of the reliability analysis of the subscales obtained in the study are presented in Table 3. The coefficients of the seven original subscales were as follows: OLDC = 0.91; MSL = 0.90; TW = 0.85; WE = 0.71; RPSE = 0.66; NPWH = 0.66; and RE = 0.66. As subscales, OLDC had thirteen items, MSL had eight items, TW had four items, and these subscales were determined to be highly reliable. Moreover, WE had four items, RPSE had three items, NPWH had four items, and RE had three items, and these subscales were determined to be quite reliable. However, it is also known that it is possible to obtain a low Cronbach’s alpha coefficient when the number of items is low.
4. Discussion
The KMO coefficient provides information about the suitability of both the data matrix for factor analysis and the data structure for factor extraction, while Bartlett’s test can be considered evidence of both the suitability of the data matrix and the normality of the scores [28]. In this study, according to the result of the KMO test, the data were suitable for factor analysis, and based on the result of the Bartlett’s test of sphericity, the data were determined to be normally distributed for factor analysis [35, 38]. Some KMO and Bartlett’s test results reported in various cultural adaptations of scales are as follows: the Hospital Survey on Patient Safety Culture in Palestine (HSOPSC-AR) KMO = 0.85, Bartlett’s test, p < 0.001 [39]; the Hospital Survey on Patient Safety Culture in Türkiye (HSOPS-TR) KMO = 0.9, Bartlett’s test, p < 0.001 [21]; the Hospital Survey on Patient Safety Culture in China (HSPSC-CN) Version KMO > 0.7, Bartlett’s test, p < 0.05 [40]; the Hospital Survey on Patient Safety Culture in Croatia (HSOPSC-HRK) KMO = 0.887, Bartlett’s test, p < 0.001 [41], and the Hospital Survey on Patient Safety Culture in Dutch (HSOPS-NL) KMO = 0.9, Bartlett’s test, p < 0.001 [42]. The KMO and Bartlett’s test results of other studies in the literature showed compatibility with the results of this study.
The rate of the total variance in the measured variable explained by the factors of a scale, which is expected to be above 0.30 in multifactor constructs, indicates how well the related construct is measured, and this rate is expected to be high. The rates of the total variance explained by factors in various studies are as follows: HSOPSC-AR 61.44% [39]; HSOPS-TR 62.1% [21]; HSPSC-CN 60% [40]; HSOPSC-HRK 59% [41], and HSOPS-NL 57.1% [42]. These results were similar to the results obtained in this study. A high variance explanation rate shows how well the relevant construct is measured [28]. Hence, it can be stated that the total variance explained by the factors of the scale that was developed in this study was acceptable for the social sciences.
The factor load value is a coefficient explaining the relationship between items and factors, and it is expected to be high [28]. The existence of a cluster of items with high levels of correlation with a factor means that those items together measure a factor [28]. Some factor load value ranges in various cultural adaptations of scales were reported as follows: the Safety Attitudes Questionnaire (SAQ) between 0.40 and 0.81 [43]; the Safety Attitudes Questionnaire in Turkish (SAQ-TR) between 0.29 and 0.78 [20]; the Safety Attitudes Questionnaire in Danish (SAQ-DK) between 0.28 and 0.88 [44]; the Safety Attitudes Questionnaire German language version in Swiss (SAQ-CH) between 0.345 and 0.862 [45]; HSOPSC-AR between 0.43 and 0.88 [39]; HSOPS-TR between 0.36 and 0.87 [21]; HSPSC-CN between 0.40 and 0.70 [40]; HSOPSC-HRK between 0.365 and 0.908 [41]; HSOPS-NL between 0.36 and 0.88 [42], and the Hospital Survey on Patient Safety Culture (HSPSC-MX) between 0.324 and 0.970 [46]. All factor load values that were determined in this study were >0.30, and they were similar to the factor load values in previous scale development studies in the relevant literature.
The goodness-of-fit value refers to how well the parameter estimates of the results of CFA (i.e. factor loads, factor correlations, error covariances) reproduce the relationships observed in the sample’s data [37]. Among the model fit indices of this study, the chi-squared test statistic (χ^2^), χ^2^/df, comparative fit index (CFI), normed fit index (NFI), and incremental fit index (IFI) showed an excellent fit, while the root mean square error of approximation (RMSEA) and standardized root mean square residual (SRMR) values showed a good fit. In general, the goodness-of-fit indices of the scale in this study showed a good fit. In previous studies in which scales have been tested, the goodness-of-fit indices of SAQ [43], SAQ-TR [20], the Safety Attitudes Questionnaire in Chinese (SAQ-CN) [47], and the Safety Attitudes Questionnaire in Norwegian (SAQ-NO) [48] have shown satisfactory fit results. HSOPSC-AR [39], HSPSC-MX [46], the Safety Attitudes Questionnaire in Swedish (SAQ-SE) [49], SAQ-DK [44], and SAQ-CH [45] showed good fit results. Finally, goodness-of-fit values of the Patient Safety Climate in Healthcare Organizations (PSCHO) scale was found to be good [50].
The Cronbach’s alpha coefficient is used for the assessment of internal consistency. In the relevant literature, it was stated that Cronbach’s alpha coefficients in the range of 0.00–0.39 indicate that the examined scale is unreliable, those in the range of 0.40–0.59 indicate low reliability, those in the range of 0.60–0.79 indicate moderate reliability, and those in the range of 0.80–1.00 indicate high reliability [38]. In general, when this coefficient is 0.70 or higher, the results are considered reliable. As subscales, OLDC, MSL, and TW were determined as highly reliable, and WE, RPSE, NPWH, and RE were determined as quite reliable. However, it is also known that it is possible to obtain low Cronbach’s alpha coefficients when the number of items is low [51]. In previous studies, Cronbach’s alpha values of scales about a patient safety culture have been reported as HSOPSC-AR 0.87 [39]; HSOPS-TR 0.88 [21]; HSPSC-CN 0.84 [40]; HSOPSC-HRK 0.88 [41]; HSPSC-MX 0.71 [46]; SAQ-CN 0.945 [47]; SAQ-TR 0.89 [20]; and SAQ-DK 0.89 [44].
The aim of this study was to develop the Patient Safety Culture Scale for Hospitals (PSCSH) and test its validity and reliability. As a result of the psychometric properties of PSCSH, the scale was determined to be a valid and reliable measurement instrument for healthcare professionals. It is believed that the scale will be useful in determining the current patient safety culture at an institution and following up on its development, especially in healthcare institutions that will start initial patient safety practices.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Slawomirski L Klazinga N The Economics of Patient Safety From Analysis to Action OECD Health Working Papers No:145 OECD Publishing 2022
- 2Panagioti M Khan K Keers RN Abuzour A Phipps D Prevalence, severity, and nature of preventable patient harm across medical care settings: systematic review and meta-analysis The BMJ 2019 366 l 4185 1 27 10.1136/bmj.l 4185 PMC 693964831315828 · doi ↗ · pubmed ↗
- 3de Vries EN Ramrattan MA Smorenburg SM Gouma DJ Boermeester MA The incidence and nature of in-hospital adverse events: a systematic review Quality and Safety in Health Care 2008 17 3 216 223 10.1136/qshc.2007.023622 18519629 PMC 2569153 · doi ↗ · pubmed ↗
- 4Committee on Improving the Quality of Health Care Globally, Board on Global Health, Board on Health Care Services, Health and Medicine Division Crossing the Global Quality Chasm: Improving Health Care Worldwide Washington (DC) National Academies Press (US) 2018 30605296 · pubmed ↗
- 5Elmontsri M Banarsee R Majeed A Improving patient safety in developing countries - moving towards an integrated approach Journal of the Royal Society of Medicine Open 2018 9 11 1 5 10.1177/2054270418786112 PMC 623686830455962 · doi ↗ · pubmed ↗
- 6Institute of Medicine Committee on Quality of Health Care in America To Err is Human: Building a Safer Health System Washington (DC) National Academies Press (US) 2000 25077248 · pubmed ↗
- 7World Health Organization Conceptual Framework for The International Classification for Patient Safety Version 1.1 Geneva World Health Organization 2010
- 8World Health Organization Global Patient Safety Action Plan 2021–2030 Towards Eliminating Avoidable Harm in Health Care Geneva World Health Organization 2021
