Development and 2025 re-evaluation of a Japanese quality indicator set for adult intensive care: a modified RAND/UCLA Delphi study
Junji Kumasawa, Hiroshi Okamoto, Takaki Naito, Takehiko Asaga, Masayoshi Kimura, Toru Kotani, Yasuhiro Komatsu, Takeshi Suzuki, Tetsuya Takahashi, Tetsuhiro Takei, Masaji Nishimura, Satoru Hashimoto, Hiroko Yamaguchi, Matsuyuki Doi

TL;DR
This study developed and updated a set of quality indicators for adult intensive care units in Japan to improve care standards and enable national benchmarking.
Contribution
The paper presents a re-evaluated and updated Japanese ICU quality indicator set using a modified RAND/UCLA Delphi method in 2025.
Findings
A 2018 consensus set of 38 ICU quality indicators was established through expert ratings and meetings.
In 2025, 37 of the 2018 indicators were retained as Appropriate, with one removed and one new indicator added.
The updated 2025 core set provides a foundation for national benchmarking and quality improvement in Japanese ICUs.
Abstract
Quality indicators (QIs) support measurement and improvement of ICU care; however, until 2017 there was no comprehensive, evidence-based QI set tailored to frontline adult ICU practice in Japan. Because evidence and practice evolve, periodic updating is required. We developed a Japanese ICU QI set in 2017–2018 and conducted a formal re-evaluation in 2025 to confirm current validity and update the core set. We used a modified RAND/UCLA appropriateness method. First, an initial list of candidate QIs was generated through a systematic literature review and major clinical guidelines. Next, a multidisciplinary panel of 14 Japanese experts (including intensivists, a nurse, a physiotherapist, and a clinical engineer) rated the appropriateness of these indicators using a nine-point Likert scale (Round 1). Following Round 1, a face-to-face consensus meeting was held to discuss, refine, add, or…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSepsis Diagnosis and Treatment · Intensive Care Unit Cognitive Disorders · Delphi Technique in Research
Introduction
Intensive care units (ICUs) provide life-sustaining care for the most critically ill patients, but this care is complex, resource-intensive, and carries inherent risks. Ensuring the delivery of consistent, high-quality care is, therefore, a global priority to optimize patient outcomes and improve healthcare efficiency. Quality indicators (QIs) have emerged as a standard, evidence-based tool for measuring, monitoring, and improving the quality of healthcare across various disciplines. Rooted in Donabedian’s classic framework of structure, process, and outcome [1], QIs translate clinical evidence into measurable metrics that can reveal gaps between recommended and actual practice, thereby identifying opportunities for improvement.
In the field of critical care, several international bodies, such as the European Society of Intensive Care Medicine (ESICM), have developed foundational QI sets that have advanced the science of quality measurement [2]. These international efforts provide an invaluable framework and a common language for discussing ICU quality. However, the direct application of these indicator sets across different national healthcare systems can be challenging. Significant variations in clinical practice patterns, patient demographics, data infrastructure, and resource availability can limit the relevance and feasibility of indicators developed in one context when applied to another.
This issue has been particularly pertinent in Japan. Although Japan has a sophisticated healthcare system and a national ICU patient registry (the Japanese Intensive care PAtient Database, JIPAD) [3], until we initiated this study in 2017, there had been no comprehensive, evidence-based set of QIs developed specifically to assess the quality of frontline clinical care in Japanese ICUs. This absence of a standardized, contextually validated toolkit was a significant barrier to nationwide quality benchmarking, identifying areas for improvement, and systematically evaluating the impact of quality improvement initiatives. A dedicated QI set tailored to the Japanese healthcare environment was, therefore, urgently needed. To address this gap, we developed a Japanese ICU QI set through a formal multidisciplinary consensus process conducted in 2017–2018 using a modified RAND/UCLA appropriateness method.
However, quality indicator sets are not static. Because critical care evidence and clinical practice evolve rapidly, indicator sets require periodic re-evaluation and updating to maintain current validity and feasibility. To ensure that this QI set remains relevant to contemporary practice, we, therefore, performed an additional re-evaluation in 2025 using the same RAND/UCLA-based rating approach.
Therefore, this study reports [1] the development of a comprehensive and evidence-based set of quality indicators for adult ICU care tailored to the Japanese healthcare system using a modified RAND/UCLA appropriateness method and a formal multidisciplinary consensus process, and [2] a 2025 re-evaluation round to update and assess the current validity of the resulting indicator set.
Methods
Study design
This is a Delphi study to develop and update a set of QIs for the care of patients in ICUs. We used the modified RAND/UCLA appropriateness method to develop a set of QIs by combining expert opinion and scientific evidence [4]. This consensus method has been widely used to develop QIs [5–7]. The initial development phase (2017–2018) comprised four steps. First, we identified a list of candidate indicators through a systematic literature review and relevant clinical guidelines. Second, we carried out a first round of questionnaires, asking members of the expert panel to assess the appropriateness of each potential indicator. Third, an in-person expert panel meeting was held, during which the panelists reviewed and discussed the appropriateness of the indicators based on the results from the first questionnaire round. Fourth, we conducted a second round of questionnaires to finalize the 2018 consensus set of quality indicators. To address the substantial time gap since the original consensus process (2017–2018), we additionally conducted a re-evaluation round in 2025 (Round 3), in which the expert panel re-rated the final 2018 QI set using the same RAND/UCLA rating approach, with an option to rate newly proposed indicators reflecting contemporary practice. This study did not involve analysis of patient-level data from electronic health records; all evaluations were based solely on the expert panel’s ratings.
This study protocol was reviewed and approved by the Ethics Committee of Sakai City Medical Center (Approval No. S-17-061; date: 10 Dec 2017).
Quality indicators
QIs are crucial instruments for evaluating healthcare quality, providing a scientific basis for evaluating performance. The calculation of QIs typically relies on two primary components:
Denominator: The defined patient population or condition to which the QI applies.
Numerator: The criteria outlining appropriate treatment or desired outcomes.
QI can be utilized to determine either:
The proportion of patients receiving appropriate treatment within a given facility.
The number of indicators achieved at the individual patient level.
Rooted in Donabedian’s framework, which encompasses three dimensions: structure, process, and outcome, process indicators, which describe the delivery or reception of care, are particularly valuable as they represent modifiable factors directly influencing quality improvement [1].
Step 1: Systematic review and selection of indicator candidates
We systematically searched MEDLINE from Jan 1946 to 6th Nov 2017. The full search strategies are described in the Supplementary Methods 1. Then, we extracted candidate indicators from the full-text review. Three intensive care specialists (JK, HO and TN) assessed and extracted data independently.
Step 2: First questionnaire round
The expert panel for this study comprised fourteen professionals from diverse medical and allied health disciplines, including ten intensivists, one nephrologist, one physiotherapist, one clinical engineer, and one nurse, selected from different geographical regions and various types of institutions, including both academic and community hospitals, to ensure a broad representation of clinical practice in Japan. We distributed a questionnaire via email, requesting each expert to evaluate the appropriateness of potential quality indicators using a nine-point Likert scale (1 = extremely inappropriate; 9 = extremely appropriate). The nine-point scores were divided into three ranges: 1–3 (inappropriate), 4–6 (uncertain), and 7–9 (appropriate). Each indicator was then classified as “Inappropriate”, “Uncertain,” or “Appropriate” based on the median score and the level of panel agreement/disagreement.
- An indicator was classified as Inappropriate if its median score was in the 1–3 range, without disagreement.
- An indicator was classified as Uncertain if its median score was in the 4–6 range, or if the panel’s ratings showed disagreement, regardless of the median.
- An indicator was classified as Appropriate if its median score was in the 7–9 range, without disagreement.
For the primary analysis, disagreement was defined as the situation in which at least five panelists rated an indicator in the low-appropriateness range [1–3] and at least five panelists rated it in the high-appropriateness range [7–9]. As a sensitivity analysis, we also assessed disagreement using the interpercentile range adjusted for symmetry (IPRAS) method described in the RAND/UCLA manual [8] and confirmed that the overall classification did not change when using the IPRAS-based definition (see Supplementary Methods 2).
Indicators classified as ‘Appropriate’ or ‘Uncertain’ were carried forward for discussion at the subsequent expert panel meeting.
Step 3: Expert panel meeting
All panel members participated in a face-to-face meeting to review the preliminary results from the initial questionnaire round. The primary purpose of this meeting was to facilitate open discussions on each potential indicator and to reach a consensus. During the meeting, the experts qualitatively evaluated the indicators based on three key criteria: (1) broad applicability across diverse intensive care settings, (2) measurability using data sources commonly available in Japan (e.g., electronic health records, administrative claims, registry data, and, where applicable, facility-level surveys), and (3) relevance to quality improvement, with particular emphasis on actionable (modifiable) process indicators. Indicators failing to meet these criteria were either revised or excluded. In addition, experts were encouraged to propose new potential indicators if deemed necessary. These newly suggested indicators were qualitatively assessed in the meeting and subsequently included in the second quantitative assessment.
In this study, “facility-level surveys” refer to institutional surveys (e.g., staffing, protocols, and safety systems) rather than patient-reported surveys. For hand hygiene, the survey is intended to capture facility-reported compliance results from local audit/monitoring programs based on the WHO direct observation framework, in which compliance is defined as the proportion of fulfilled hand hygiene opportunities among observed opportunities.
Step 4: Second questionnaire round and final selection (round 2; 2018 consensus set)
Following the expert panel meeting, a revised list of accepted, modified, and newly proposed indicators was compiled into a second questionnaire. This questionnaire was then sent to panel members via email for final evaluation. In this second round, indicators were re-rated using the same nine-point Likert scale as the first round. Indicators achieving “Appropriate” defined above constituted the final 2018 consensus set of quality indicators for care of patients in ICUs [5, 8, 9].
Step 5: Re-evaluation round (round 3; 2025 update)
In 2025, the expert panel re-rated all indicators from the 2018 consensus set to re-assess their current validity. The Round 3 questionnaire was distributed by email to all 14 members of the original panel, and all 14 panelists returned complete ratings (response rate: 14/14, 100%). The re-evaluation was conducted as an email-based survey without an additional face-to-face or virtual consensus meeting between ratings. Individual ratings were not shared with other panelists during Round3. To support a time-updated appraisal, panelists were provided with the finalized 2018 indicator definitions and the key background materials used in the original process, together with selected updates reflecting contemporary practice where relevant. A free-text comment field was included for each indicator, and panelists were invited to propose additional candidate indicators reflecting evolving standards; newly proposed candidates were compiled and circulated for rating within the same RAND/UCLA-based framework. The same nine-point scale and classification rules were applied. Indicators rated as “Appropriate” in Round 3 were retained in the updated set, whereas indicators rated as “Uncertain” or “Inappropriate” were not retained as core QIs but were reported transparently.
Statistical analysis
This study did not involve analysis of patient-level data but relied solely on scores rated by the expert panel. We summarized ratings using medians and RAND/UCLA-based agreement/disagreement rules. Disagreement was assessed primarily using the predefined count-based definition and, as a sensitivity analysis, using the IPRAS method. Details of the IPRAS calculation and the comparison between definitions are provided in Supplementary Methods 2.
Results
We identified 670 articles in the initial search, and after screening abstracts, selected 69 studies for full text review. We included 14 articles as sources of QIs [2, 8, 10–21]. The study flow diagram is shown in Fig. 1. We also reviewed the evidence-based guidelines published by the Japanese Society of Intensive Care Medicine [22, 23] and leading international guidelines [24–28]. After a screening process, 44 potential QIs were extracted for evaluation. Table 1 shows the details of extracted potential QIs.Fig. 1PRISMA flow diagram of the study selection process. The diagram details the process of identifying, screening, and selecting relevant studies for inclusion in the review of quality indicator development in the intensive care unit. The initial search identified 670 records. After screening and a full-text eligibility assessment, 14 studies were included in the final reviewTable 1Description of the potential quality indicators prepared for the first-round evaluationNoQuality indicatorDenominator to calculate the QI (criteria for applicable patients)Numerator to calculate the QI (criteria for achievement)Type of QI1ICU mortalityPatients discharged from the ICUPatients who die during the ICU stayOutcome2Hospital mortalityPatients who are admitted to the ICU during their hospitalization and are subsequently discharged from the hospitalPatients who die during the hospital stayOutcome330-day mortalityPatients discharged from the ICUPatients who died within 30 days of ICU admissionOutcome4Standardized mortality rateMean predicted mortality rateObserved mortality rateOutcome5Readmission to the ICUPatients discharged alive from the ICUPatients with an unplanned ICU readmission within 72 h of dischargeOutcome6Nighttime ICU dischargePatients discharged alive from the ICUPatients discharged from the ICU between 22:00 and 06:59Outcome7ICU bed occupancy rateNumber of ICU beds × 365 daysNumber of patients admitted to the ICU in 1 yearStructure8ICU stay lasting 7 days or morePatients discharged from the ICUPatients who stayed in the ICU for 7 days or moreOutcome9Length of stay in ICUPatients discharged from the ICUSum of the length of stay for all patients discharged from the ICUOutcome10Full ICU occupation rateTotal number of days in a year (365)Number of days in a year the ICU was fully occupiedOutcome11Targeted temperature management after cardiac arrestPatients discharged from the ICU after cardiopulmonary resuscitationPatients who received therapeutic hypothermiaOutcome12Monitoring sedationN/AThe facility has an established policy and procedure for sedation monitoringStructure13Monitoring painN/AThe facility has an established policy and procedure for pain monitoringStructure14Identification of deliriumN/AThe facility has an established policy and procedure for delirium monitoringStructure15ReintubationMechanically ventilated patients discharged from the ICU (excluding those with a tracheostomy)Patients who were reintubated within 48 h of extubationOutcome16Duration of mechanical ventilationMechanically ventilated patients discharged from the ICUSum of the duration of mechanical ventilation for all ventilated patientsOutcome17Prolonged mechanical ventilationMechanically ventilated patients discharged from the ICUPatients who received mechanical ventilation for 10 days or moreOutcome18Postoperative respiratory failurePostoperative patients admitted to and discharged from the ICUPatients who received mechanical ventilation for 10 days or more after surgeryOutcome19Unplanned extubationTotal number of mechanical ventilation days for all ventilated patientsPatients who had an unplanned extubationOutcome20Use of a weaning protocolN/AThe facility has an established policy and procedure for weaning from mechanical ventilationStructure21Ventilator-associated pneumonia (VAP) prevention bundleMechanically ventilated patients discharged from the ICUPatients for whom the VAP prevention bundle was adhered toProcess22Low tidal volume ventilation in ARDSPatients with ARDS who received mechanical ventilation in the ICU and were discharged from the ICUPatients managed with a tidal volume of 6–8 mL/kg (predicted body weight) and an inspiratory plateau pressure < 30 cmH2OProcess23Early enteral nutritionPatients who stayed in the ICU for 48 h or more and were subsequently dischargedPatients who received enteral nutrition within 48 h of ICU admissionProcess24Stress ulcer prophylaxisPatients discharged from the ICUPatients who received a proton pump inhibitor (PPI) or H2 blockerProcess25Stress ulcer prophylaxis during mechanical ventilationMechanically ventilated patients discharged from the ICUPatients who received a proton pump inhibitor (PPI) or H2 blockerProcess26Glycemic controlTotal number of blood glucose measurements for all patients during their ICU stayNumber of measurements, where blood glucose was ≤ 80 mg/dL or ≥ 200 mg/dLProcess27Blood culture samplingNon-postoperative patients who received antibiotics and were discharged from the ICUPatients who had blood cultures drawn before the initiation of antibioticsProcess28Prevalence of MRSAPatients discharged from the ICUPatients with MRSA detected from a specimen collected within 24 h of ICU admissionOutcome29New-onset MRSAPatients discharged from the ICUPatients with new-onset MRSA detected more than 24 h after ICU admissionOutcome30New-onset multidrug-resistant organismsPatients discharged from the ICUPatients with a new multidrug-resistant organism (resistant to ≥ 2 agents) detected more than 24 h after ICU admissionOutcome31Restrictive transfusion strategy in sepsisPatients with sepsis discharged from the ICU (excluding those with myocardial infarction, severe hypoxemia, or acute hemorrhage)Patients who received RBC transfusion only when their hemoglobin level was below 7 g/dLProcess32Red blood cell (RBC) transfusion administrationPatients discharged from the ICUPatients who received an RBC transfusionProcess33Amount of RBC transfusionPatients who received an RBC transfusion and were discharged from the ICUTotal units of RBCs transfusedProcess34Plasma transfusion administrationPatients discharged from the ICUPatients who received a plasma transfusionProcess35Amount of plasma transfusionPatients who received a plasma transfusion and were discharged from the ICUTotal units of plasma transfusedProcess36Venous thromboembolism (VTE) prophylaxisPatients discharged from the ICUPatients who received anticoagulation therapy for VTE prophylaxisProcess37Hand hygieneNumber of observed hand hygiene opportunities during audit observations (e.g., WHO 5 Moments for Hand Hygiene)Number of observed opportunities in which hand hygiene was performed appropriatelyProcess38Early rehabilitationPatients discharged from the ICUPatients who started rehabilitation within 48 h of ICU admissionProcess39Dedicated intensivist hoursN/AThe number of hours an intensivist is dedicated to the ICUStructure40Patient-to-nurse ratioN/AThe number of patients assigned to one nurseStructure41Multidisciplinary rounds and daily goal settingN/AMultidisciplinary rounds and daily goal setting are implementedStructure42Standardized handoff communicationN/AA standardized handoff method is in placeStructure43Adverse event reporting systemN/AA system for reporting adverse events is in placeStructure44Incident reporting by physiciansTotal number of incident reports for ICU patientsNumber of incident reports submitted by physicians for ICU patientsStructureN/A: Not applicable; ICU: Intensive Care Unit; QI: Quality Indicator; VAP: Ventilator-associated pneumonia; ARDS: Acute Respiratory Distress Syndrome; MRSA: Methicillin-Resistant Staphylococcus Aureus; RBC: Red Blood Cell; VTE: Venous Thromboembolism; WHO: World Health Organization
First questionnaire round
In the first evaluation round, conducted between 1st May and 31st May 2018, the fourteen expert panel members rated the 44 potential QIs. Of the 44 initial candidates, the 34 indicators had a median appropriateness score of 7 or higher. Among these 34 indicators, 33 showed agreement and were classified as “Appropriate”, while the one remaining indicator had disagreement and, therefore, classified as “Uncertain”. Ten indicators had median scores between 5 and 6.5, and were all classified as “Uncertain”. No indicators were classified as “Inappropriate”. The 11 indicators rated as “Uncertain” were prioritized for discussion at the face-to-face panel meeting (Table 2).Table 2. Appropriateness ratings and final decisions for each quality indicatorNoQuality indicatorRound 1Meeting decisionRound 2Final decisionMedian ratingAgreement/disagreementMedian ratingAgreement/disagreement1ICU mortality9AgreementAppropriate9AgreementSelected2Hospital mortality7.5AgreementAppropriate9AgreementSelected330-day mortality7AgreementAppropriate7.5AgreementSelected4Standardized mortality rate8AgreementAppropriate9AgreementSelected5Readmission to the ICU8AgreementAppropriate8AgreementSelected6Nighttime ICU discharge6AgreementDeleted–––7ICU bed occupancy rate6AgreementUncertain7AgreementSelected8ICU stay lasting 7 days or more7AgreementAppropriate7AgreementSelected9Length of stay in ICU7AgreementAppropriate7AgreementSelected10Full ICU occupation rate5AgreementDeleted–––11Targeted temperature management after cardiac arrest7DisagreementUncertain6AgreementDeleted12Monitoring sedation9AgreementAppropriate9AgreementSelected13Monitoring pain9AgreementAppropriate9AgreementSelected14Identification of delirium9AgreementAppropriate9AgreementSelected15Reintubation7.5AgreementAppropriate7AgreementSelected16Duration of mechanical ventilation8AgreementAppropriate8AgreementSelected17Prolonged mechanical ventilation6.5AgreementUncertain7.5AgreementSelected18Postoperative respiratory failure7AgreementModified7.5AgreementSelected19Unplanned extubation8AgreementAppropriate7AgreementSelected20Use of a weaning protocol7.5AgreementAppropriate8AgreementSelected21Ventilator-associated pneumonia (VAP) prevention bundle8AgreementAppropriate8AgreementSelected22Low tidal volume ventilation in ARDS8AgreementAppropriate7.5AgreementSelected23Early enteral nutrition7AgreementAppropriate7.5AgreementSelected24Stress ulcer prophylaxis6.5AgreementUncertain6AgreementDeleted25Stress ulcer prophylaxis during mechanical ventilation8AgreementModified7AgreementSelected26Glycemic control7AgreementAppropriate7AgreementSelected27Blood culture sampling9AgreementAppropriate8AgreementSelected28Prevalence of MRSA6AgreementDeleted–––29New-onset MRSA8AgreementAppropriate7AgreementSelected30New-onset multidrug-resistant organisms7AgreementAppropriate7AgreementSelected31Restrictive transfusion strategy in sepsis7AgreementAppropriate6.5AgreementSelected32Red blood cell (RBC) transfusion administration6AgreementDeleted–––33Amount of RBC transfusion5AgreementDeleted–––34Plasma transfusion administration5AgreementDeleted–––35Amount of plasma transfusion5AgreementDeleted–––36Venous thromboembolism (VTE) prophylaxis7AgreementModified8AgreementSelected37Hand hygiene8.5AgreementAppropriate8AgreementSelected38Early rehabilitation8AgreementAppropriate8AgreementSelected39Dedicated intensivist hours8AgreementAppropriate8AgreementSelected40Patient-to-nurse ratio8AgreementAppropriate8AgreementSelected41Multidisciplinary rounds and daily goal setting8AgreementAppropriate8AgreementSelected42Standardized handoff communication7AgreementAppropriate7.5AgreementSelected43Adverse event reporting system7AgreementAppropriate7.5AgreementSelected44Incident reporting by physicians7AgreementAppropriate7.5AgreementSelected45Ventilator-associated pneumonia (VAP)––Added8AgreementSelected46Number of clinical engineers––Added7AgreementSelected47Number of pharmacists––Added7AgreementSelectedICU: Intensive Care Unit; VAP: Ventilator-associated pneumonia; ARDS: Acute Respiratory Distress Syndrome; MRSA: Methicillin-Resistant Staphylococcus Aureus; RBC: Red Blood Cell; VTE: Venous Thromboembolism
Expert panel meeting
The face-to-face panel meeting was held on 13th June 2018. During the meeting, the results of the first questionnaire round were presented, and all QIs were discussed based on the predetermined criteria, such as clinical importance, scientific evidence, and feasibility of measurement in the ICU setting. As a result of the discussion, 3 of the 44 potential indicators were modified, 7 were deleted, and 3 new indicators were added by the panel members. Reasons for modification or deletion are summarized in Table 3 [29, 30]. The primary reasons for deletion included a lack of clear evidence, difficulty in data collection from electronic health records, and overlap with other indicators.Table 3. Reasons for modifications and deletions of quality indicators after the expert panel meetingNoQuality indicatorStatusReasons for modification or deletion1Nighttime ICU dischargeDeletedThe panel concluded that nighttime discharge does not necessarily reflect poor quality of care and can be clinically appropriate in certain situations2Full ICU occupation rateDeletedThis indicator was considered to be more related to hospital management and resource allocation than to the direct quality of clinical care3Targeted temperature management after cardiac arrestDeletedDeleted, because recent clinical evidence [e.g., the TTM trial [29, 30]] suggests that targeted normothermia is an acceptable alternative to hypothermia. Therefore, the panel concluded that mandating hypothermia alone as a quality indicator is not aligned with current evidence-based practice4Postoperative respiratory failureModifiedThe definition was modified to improve specificity and fairness. The denominator was narrowed to include only patients after elective surgery. In addition, the time threshold for defining respiratory failure was changed from 10 days to 96 h, as the panel concluded the 10-day period lacked a strong evidence base5Stress ulcer prophylaxisDeletedDeleted, because the panel concluded there is a lack of evidence for routine prophylaxis in all ICU patients. It was agreed that a more appropriate indicator would target high-risk patients for whom prophylaxis is recommended, such as those on mechanical ventilation6Stress ulcer prophylaxis during mechanical ventilationModifiedThe definition was refined as the denominator was narrowed to target a more specific high-risk population by including only patients on mechanical ventilation for 48 h or more7Prevalence of MRSADeletedThis indicator was considered to reflect community-acquired infections or issues at a previous hospital rather than the quality of care within the ICU itself8Red blood cell (RBC) transfusion administrationDeletedDeleted, because the panel concluded that the act of administering an RBC transfusion itself does not reflect the quality of care, and there is no evidence to support its use as a quality indicator9Amount of RBC transfusionDeletedThe panel decided that the more critical process indicator is the transfusion strategy, not the total volume of blood transfused10Plasma transfusion administrationDeletedDeleted, because the panel concluded that the act of administering a plasma transfusion itself does not reflect the quality of care, and there is no evidence to support its use as a quality indicator11Amount of plasma transfusionDeletedSimilar to RBCs, the appropriateness of the transfusion decision was deemed more important than the total volume, and evidence for a specific volume target is lacking12Venous thromboembolism (VTE) prophylaxisModifiedThe indicator was broadened to include mechanical prophylaxis (e.g., intermittent pneumatic compression and elastic stockings) in addition to pharmacological therapy13Ventilator-associated pneumonia (VAP) per 1,000 ventilator-daysAddedAdded, because VAP is a clinically important ICU-acquired complication and a widely used outcome metric for quality benchmarking; it complements the existing process indicator (VAP prevention bundle) and can be operationalized using an established surveillance definition (e.g., JANIS)14Number of clinical engineersAddedAdded to reflect Japan-specific ICU staffing and safety systems, where clinical engineers play a key role in supporting safe operation and management of life-support devices; this structural element is measurable via an institutional (facility-level) survey15Number of pharmacistsAddedAdded, because ICU-dedicated pharmacists contribute to medication safety and quality improvement activities (e.g., review/optimization of pharmacotherapy); this structural element is measurable via an institutional (facility-level) surveyThe modification/deletion decisions summarized in Table 3 were made during the 2018 face-to-face panel meeting based on the evidence available at that time. Where later publications are cited in the rationale [e.g., the TTM2 trial published in 2021 [30]], these references were provided to contextualize how the evidence base subsequently evolved and/or to reflect information considered during the 2025 re-evaluation; such evidence was not available to the panel at the time of the 2018 meetingICU: Intensive Care Unit; MRSA: Methicillin-Resistant Staphylococcus Aureus; RBC: Red Blood Cell; VAP: Ventilator-associated pneumonia
Second questionnaire round and finalization of the 2018 consensus set (round 2)
The revised list of 40 QIs (including modified, retained, and newly added indicators) was sent to the panel members for the second and final rating round. In this round, 37 indicators had a median score of 7 or higher, with the remaining three scoring between 6 and 6.5. All indicators reached panel agreement. Consequently, 37 indicators were deemed “Appropriate” and three “Uncertain”. After the final evaluation, two indicators were deleted (Table 2). As a sensitivity analysis, disagreement was also assessed using the IPRAS method; the appropriateness classification did not change compared with the primary count-based definition (Supplementary Table S1).
Ultimately, a final set of 38 QIs for quality of care in the ICU was established. Of these, 13 are structure indicators, 10 are process indicators, and 15 are outcome indicators. Most selected QIs can be measured using data available from existing electronic health records and administrative claims data, while a subset requires additional data sources, such as facility-level surveys or direct observation (Table 4). The 2018 consensus set of selected QIs is shown in Table 4 [31, 32].Table 4. Final set of selected quality indicators for ICU careNoQuality indicatorDenominator to calculate the QI (criteria for applicable patients)Numerator to calculate the QI (criteria for achievement)Type of QIData sources1ICU mortalityPatients discharged from the ICUPatients who die during the ICU stayOutcomeJIPAD, Administrative claims data2Hospital mortalityPatients who are admitted to the ICU during their hospitalization and are subsequently discharged from the hospitalPatients who die during the hospital stayOutcomeJIPAD, Administrative claims data330-day mortalityPatients discharged from the ICUPatients who died within 30 days of ICU admissionOutcomeJIPAD, Administrative claims data4Standardized mortality rateMean predicted mortality rate calculated by severity scores, such as APACHE IIObserved mortality rateOutcomeJIPAD5Readmission to the ICUPatients discharged alive from the ICUPatients with an unplanned ICU readmission within 72 h of dischargeOutcomeJIPAD, Administrative claims data6ICU bed occupancy rateNumber of staffed ICU beds × 365 daysTotal number of ICU patient-days during the yearStructureAdministrative claims data7ICU stay lasting 7 days or morePatients discharged from the ICUPatients who stayed in the ICU for 7 days or moreOutcomeJIPAD, Administrative claims data8Length of stay in ICUPatients discharged from the ICUSum of the length of stay for all patients discharged from the ICUOutcomeJIPAD, Administrative claims data9Monitoring sedationN/AThe facility has an established policy and procedure for sedation monitoringStructureSurvey data10Monitoring painN/AThe facility has an established policy and procedure for pain monitoringStructureSurvey data11Identification of deliriumN/AThe facility has an established policy and procedure for delirium monitoringStructureSurvey data12ReintubationMechanically ventilated patients discharged from the ICU (excluding those with a tracheostomy)Patients who were reintubated within 48 h of extubationOutcomeJIPAD, Administrative claims data13Duration of mechanical ventilationMechanically ventilated patients discharged from the ICUSum of the duration of mechanical ventilation for all ventilated patientsOutcomeJIPAD, Administrative claims data14Prolonged mechanical ventilationMechanically ventilated patients discharged from the ICUPatients who received mechanical ventilation for 10 days or moreOutcomeJIPAD, Administrative claims data15Postoperative respiratory failurePostoperative patients admitted to and discharged from the ICU after elective surgeryPatients who received mechanical ventilation for 96 h or more after surgeryOutcomeJIPAD16Unplanned extubationTotal number of mechanical ventilation-days for all ventilated patientsPatients who had an unplanned extubationOutcomeElectronic health records17Use of a weaning protocolN/AThe facility has an established policy and procedure for weaning from mechanical ventilationStructureSurvey data18Ventilator-associated pneumonia (VAP) prevention bundlePatients who received mechanical ventilation and were discharged from the ICUPatients for whom the VAP prevention bundle was adhered toProcessElectronic health records19Ventilator-associated pneumonia (VAP) per 1,000 ventilator-daysTotal number of ventilator-days in the ICUNumber of VAP events (according to the JANIS definition [31, 32]* VAP rate (per 1,000 ventilator-days): (Numerator/Denominator) × 1000OutcomeElectronic health records20Low tidal volume ventilation in ARDSPatients with ARDS who received mechanical ventilation in the ICU and were discharged from the ICUPatients managed with a tidal volume of 6–8 mL/kg (predicted body weight) and an inspiratory plateau pressure < 30 cmH2OProcessElectronic health records21Early enteral nutritionPatients who stayed in the ICU for 48 h or more and were subsequently dischargedPatients who received enteral nutrition within 48 h of ICU admissionProcessAdministrative claims data22Stress ulcer prophylaxis during mechanical ventilationMechanically ventilated patients for 48 h or more who were discharged from the ICUPatients who received a proton pump inhibitor (PPI) or H2 blockerProcessAdministrative claims data23Glycemic controlTotal number of blood glucose measurements for all patients during their ICU stayNumber of measurements, where blood glucose was ≤ 80 mg/dL or ≥ 200 mg/dLProcessLaboratory data24Blood culture samplingNon-postoperative patients who received antibiotics and were discharged from the ICUPatients who had blood cultures drawn before the initiation of antibioticsProcessAdministrative claims data25New-onset MRSAPatients discharged from the ICUPatients with new-onset MRSA detected more than 24 h after ICU admissionOutcomeElectronic health records26New-onset multidrug-resistant organismsPatients discharged from the ICUPatients with a new multidrug-resistant organism (resistant to ≥ 2 agents) detected more than 24 h after ICU admissionOutcomeElectronic health records27Restrictive transfusion strategy in sepsisPatients with sepsis discharged from the ICU (excluding those with myocardial infarction, severe hypoxemia, or acute hemorrhage)Patients who received RBC transfusion only when their hemoglobin level was below 7 g/dLProcessElectronic health records28Venous thromboembolism (VTE) prophylaxisPatients discharged from the ICUPatients who received pharmacological or mechanical anticoagulation therapy for VTE prophylaxisProcessAdministrative claims data29Hand hygieneNumber of observed hand hygiene opportunities during audit observations (e.g., WHO 5 Moments for Hand Hygiene)Number of observed opportunities in which hand hygiene was performed appropriatelyProcessFacility-reported results from locally audited monitoring30Early rehabilitationPatients discharged from the ICUPatients who started rehabilitation within 48 h of ICU admissionProcessAdministrative claims data31Dedicated intensivist hoursN/AThe number of hours an intensivist is dedicated to the ICUStructureSurvey data32Patient-to-nurse ratioN/AThe number of patients assigned to one nurseStructureSurvey data33Multidisciplinary rounds and daily goal settingN/AMultidisciplinary rounds and daily goal setting are implemented at least 5 days per weekStructureSurvey data34Standardized handoff communicationN/AA standardized handoff method is in place for all ICU shift changes and inter-unit transfersStructureSurvey data35Adverse event reporting systemN/AA system for reporting adverse events is in placeStructureSurvey data36Incident reporting by physiciansTotal number of incident reports for ICU patientsNumber of incident reports submitted by physicians for ICU patientsStructureSurvey data37Number of clinical engineersN/ANumber of clinical engineers dedicated to the ICU during weekday day shiftsStructureSurvey data38Number of pharmacistsN/ANumber of pharmacists dedicated to the ICU during weekday day shiftsStructureSurvey dataIn this study, “Survey data” refer to institutional (facility-level) surveys (e.g., staffing, protocols, and safety systems) rather than patient-reported surveysICU: Intensive Care Unit, QI: Quality Indicator, JIPAD: the Japanese Intensive care PAtient Database, N/A: Not applicable, VAP: Ventilator-associated pneumonia, JANIS: Japan Nosocomial Infection Surveillance, ARDS: Acute Respiratory Distress Syndrome, MRSA: Methicillin-Resistant Staphylococcus Aureus, RBC: Red Blood Cell, VTE: Venous Thromboembolism
Re-evaluation round (round 3; 2025 update)
In 2025, all 14 original panelists participated in Round 3 and returned complete ratings (14/14, 100%). The expert panel re-rated all indicators from the 2018 consensus set using the same nine-point scale and classification rules. Of the original 38 indicators, 37 remained classified as “Appropriate”, whereas one indicator (restrictive transfusion strategy in sepsis) was reclassified as “Uncertain” (median 6) and was, therefore, not retained in the updated core set.
In addition, one newly proposed indicator reflecting updated international practice (No. 39, ventilator-associated events per 1,000 ventilator-days) was proposed for rating, rated as “Appropriate” (median 8) and was included in the updated set. No indicator met the criterion for disagreement using either the primary count-based definition or the IPRAS definition; the appropriateness classification was unchanged when using the IPRAS-based definition (Supplementary Table S2). Round 3 re-evaluation results and final status are shown in Table 5 [33, 34]. Changes in median appropriateness scores and rating distributions between Round 2 (2018) and Round 3 (2025) are summarized in Supplementary Table S4.Table 5. Updated final quality indicators status after the 2025 re-evaluation (Round 3)NoQuality indicatorType of QIRound 3Final statusMedian ratingAppropriateness classification1ICU mortalityOutcome9AppropriateRetained2Hospital mortalityOutcome9AppropriateRetained330-day mortalityOutcome8AppropriateRetained4Standardized mortality rateOutcome9AppropriateRetained5Readmission to the ICUOutcome8AppropriateRetained6ICU bed occupancy rateStructure7AppropriateRetained7ICU stay lasting 7 days or moreOutcome7AppropriateRetained8Length of stay in ICUOutcome8AppropriateRetained9Monitoring sedationStructure9AppropriateRetained10Monitoring painStructure9AppropriateRetained11Identification of deliriumStructure9AppropriateRetained12ReintubationOutcome7AppropriateRetained13Duration of mechanical ventilationOutcome8AppropriateRetained14Prolonged mechanical ventilationOutcome8AppropriateRetained15Postoperative respiratory failureOutcome7.5AppropriateRetained16Unplanned extubationOutcome7.5AppropriateRetained17Use of a weaning protocolStructure8.5AppropriateRetained18Ventilator-associated pneumonia (VAP) prevention bundleProcess8AppropriateRetained19Ventilator-associated pneumonia (VAP) per 1,000 ventilator-daysOutcome8AppropriateRetained20Low tidal volume ventilation in ARDSProcess8AppropriateRetained21Early enteral nutritionProcess8AppropriateRetained22Stress ulcer prophylaxis during mechanical ventilationProcess8AppropriateRetained23Glycemic controlProcess8AppropriateRetained24Blood culture samplingProcess8AppropriateRetained25New-onset MRSAOutcome7AppropriateRetained26New-onset multidrug-resistant organismsOutcome7.5AppropriateRetained27Restrictive transfusion strategy in sepsisProcess6UncertainNot Retained28Venous thromboembolism (VTE) prophylaxisProcess8AppropriateRetained29Hand hygieneProcess8AppropriateRetained30Early rehabilitationProcess8AppropriateRetained31Dedicated intensivist hoursStructure8AppropriateRetained32Patient-to-nurse ratioStructure8AppropriateRetained33Multidisciplinary rounds and daily goal settingStructure8AppropriateRetained34Standardized handoff communicationStructure8AppropriateRetained35Adverse event reporting systemStructure8AppropriateRetained36Incident reporting by physiciansStructure8AppropriateRetained37Number of clinical engineersStructure8AppropriateRetained38Number of pharmacistsStructure8AppropriateRetained39Ventilator-associated events (VAE) per 1,000 ventilator daysOutcome8AppropriateNew1. Operational definitions for retained indicators are provided in Table 42. No. 27 was reclassified as Uncertain in Round 3 (median 6) and was, therefore, not retained in the updated core set3. No. 39 (VAE) was newly proposed in Round 3 to reflect updated international practice, where VAE surveillance is increasingly used alongside VAP4. The operational definition for the newly proposed indicator (No. 39 VAE) is as follows [33, 34]; Denominator: Number of ventilator-days in the same location during the surveillance period. Numerator: Number of ventilator-associated events (VAEs) identified in the surveillance location during the surveillance period. VAE rate (per 1,000 ventilator-days): (Numerator / Denominator) × 1,0005. No indicators met the criterion for disagreement using either the count-based definition or the IPRAS definition (Supplementary Method 2)6. This table also reports the indicator reclassified as Uncertain for transparencyICU: Intensive Care Unit; VAP: Ventilator-associated pneumonia; ARDS: Acute Respiratory Distress Syndrome; MRSA: Methicillin-Resistant Staphylococcus Aureus; VTE: Venous Thromboembolism; VAE: Ventilator-associated events
Discussion
In this study, we developed a comprehensive set of quality indicators for adult intensive care in Japan using a rigorous, evidence-based modified RAND/UCLA appropriateness method, and further updated this set through a formal re-evaluation round conducted in 2025. The original consensus process in 2017–2018 yielded 38 indicators, which were well-balanced across Donabedian’s domains (13 structure, 10 process and 15 outcome) and were designed with a strong emphasis on measurability, prioritizing indicators derivable from existing electronic health records and administrative claims data while also incorporating key indicators that require other data sources, such as facility-level surveys or direct observation. In the 2025 re-evaluation, the expert panel confirmed the continued appropriateness of most indicators, did not retain one indicator as a core QI, and added a contemporary surveillance indicator (VAE per 1,000 ventilator-days), resulting in an updated 2025 core set of 38 indicators. To our knowledge, this is the first study to develop a comprehensive, evidence-based set of indicators for clinical quality in Japanese ICUs. Together, these findings represent a crucial step toward standardizing quality assessment and promoting continuous quality improvement in Japanese ICUs.
The only indicator not retained as a core QI in the 2025 re-evaluation was restrictive transfusion strategy in sepsis, because transfusion thresholds in sepsis are often individualized according to patient characteristics, hemodynamics, and bleeding/ischemia risk, making a single-threshold indicator prone to exceptions and potential misclassification. Nonetheless, transfusion practice remains an important domain and warrants reconsideration in future updates.
The development of QIs for the care of patients in ICUs has been a global endeavor for decades, with multiple organizations and national initiatives proposing domains and indicators for quality measurement in critical care [2, 14, 28, 35–40]. This study aligns with these efforts by applying a structured consensus methodology and addressing fundamental aspects of ICU quality, including outcomes (e.g., mortality), safety, and prevention of healthcare-associated complications. Importantly, we extended the original 2017–2018 consensus by conducting a re-evaluation round in 2025 (Round 3), which confirmed the stability of the core set while allowing targeted refinement: one indicator was not retained as a core QI, and a contemporary surveillance indicator (ventilator-associated events, VAE) was incorporated.
To facilitate interpretation by an international audience and to enable a systematic comparison with existing international frameworks, we created Supplementary Table S3 as an international mapping table. The table maps each Japanese ICU QI to major international sources and indicates whether the correspondence is an exact match, a conceptual match within the same domain, or domain-level alignment (e.g., WHO patient safety/IPC/AMR). Overall, the comparison shows substantial alignment with global priorities while also clarifying indicators that reflect the Japanese context, especially those related to staffing and safety systems.
Even when the overarching quality domains are shared internationally, the feasibility of measurement and implementation depends heavily on local data infrastructure and health-system design. In Japan, ICU quality measurement can leverage the national ICU registry (JIPAD) and administrative claims data (DPC), but the availability, structure, and linkage of these data differ from those in other countries. In addition, the practical uptake of QIs is influenced by domestic governance structures, including facility requirements and reimbursement-related documentation, which makes explicit data-source specification and feasibility assessment particularly important in Japan. Accordingly, our indicator definitions and data-source specifications emphasize pragmatic measurability in the Japanese setting, including a transparent distinction between indicators measurable from routinely available electronic data and those requiring facility-level surveys. In addition, several indicators reflect the Japanese ICU staffing and regulatory environment (e.g., specific professional roles and institutional safety structures), which may not be specified as formal indicators in international “core sets” but are relevant to quality and safety within Japan. For certain indicators (e.g., glycemic control), we chose operational definitions that can be measured consistently across institutions using routinely available data; more granular metrics such as time-in-range may be desirable but are not always feasible in real-world data sets with heterogeneous measurement frequency.
Some context-specific indicators were proposed during the face-to-face panel meeting rather than being included in the initial candidate list before the first rating round. In our process, the initial list was anchored in literature-derived indicators and guideline-based recommendations to ensure a comprehensive evidence-based baseline. The subsequent multidisciplinary discussion made local feasibility and practice structures more explicit and enabled the panel to propose additional indicators reflecting the Japanese ICU environment. Nevertheless, future updates could strengthen contextualization earlier in the process by incorporating a structured “context scan” before the first rating round (e.g., systematic mapping to domestic regulatory/reimbursement requirements and stakeholder input), so that context-specific candidate indicators are considered from the outset.
A key strength of this study is the rigorous and reproducible consensus methodology based on the modified RAND/UCLA appropriateness method, supported by a multidisciplinary panel with broad geographic and institutional representation. We also prioritized feasibility by providing clear operational definitions and specifying realistic data sources for each indicator, distinguishing routinely available electronic data from facility-level surveys and other sources. In addition, our sensitivity analysis using the IPRAS-based disagreement definition yielded the same overall classification as the predefined count-based definition, supporting the robustness of the consensus classification approach.
The value of these QIs will ultimately depend on implementation and ongoing evaluation in real-world practice. A national measurement framework leveraging JIPAD and administrative claims data could enable routine benchmarking and feedback to participating ICUs, supporting continuous quality improvement. Future work should also evaluate the construct and predictive validity of this QI set by assessing whether higher achievement is associated with improved patient outcomes and safety, and by examining measurement reliability across institutions. As an example of how QI performance can be linked to outcomes, prior literature (e.g., Wu et al.) has reported associations between quality measurement and patient outcomes in other settings [41, 42]; however, ICU-specific validation of our indicator set remains an important next step. These efforts are essential to translate a consensus-based indicator set into measurable improvements in ICU care. Some indicators require careful interpretation as quality signals. For example, reintubation rates should be interpreted alongside balancing measures (e.g., ventilator-free days or sedation/weaning practices), because an extremely low reintubation rate is not necessarily optimal if it reflects delayed extubation or overly conservative practice.
Several limitations warrant consideration. First, although the original consensus development was conducted in 2017–2018, we addressed the substantial time gap by performing a formal re-evaluation in 2025 using the same rating and classification framework. This approach confirmed the robustness of the core set while allowing targeted updating; however, continued periodic review will remain necessary as evidence and practice evolve.
Second, the panel consisted of Japanese healthcare professionals, which was appropriate for contextual tailoring but may limit generalizability to other health systems. Third, our process did not include patients or family representatives. Patient- and family-centered outcomes, including patient-reported outcome/experience measures and longer term recovery after critical illness (e.g., post-intensive care syndrome-related domains), are increasingly emphasized internationally [43, 44]. These emerging domains were not explicitly included in the current core set and should be prioritized in future updates, potentially through inclusion of patient representatives and complementary qualitative work. Fourth, for indicators relying on facility-reported audit results (e.g., hand hygiene), audit implementation details (sampling, frequency, and assessor training) were not standardized in the original consensus process and may vary across institutions; thus, measurement and reporting bias should be considered when using these metrics for benchmarking.
Despite these limitations, this study provides a feasible and contextually adapted ICU QI set for Japan and an essential foundation for national benchmarking and quality improvement. The updated core set, together with clear operational definitions and specified data sources, can support future implementation using routinely available data and facilitate iterative refinement as evidence and practice evolve.
Conclusion
In conclusion, we developed a feasible, contextually adapted ICU QI set for Japan using a rigorous RAND/UCLA-based consensus process and confirmed its current validity through a 2025 re-evaluation round. By providing an updated core set (Table 5) and an explicit international crosswalk (Supplementary Table S3), this work offers a practical foundation for national benchmarking and continuous quality improvement while enabling transparent comparison with international standards. This framework also establishes a platform for iterative refinement as clinical evidence, data infrastructure, and societal expectations continue to evolve.
Supplementary Information
Supplementary Material 1.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Committee IFA. Ventilator-associated pneumonia prevention bundle. 2010. https://www.jsicm.org/pdf/2010 VAP.pdf.
- 2WHO Guidelines on hand hygiene in health care: first global patient safety challenge clean care is safer care. 2009.23805438 · pubmed ↗
- 3Ministry of Health La W. Japan nosocomial infections surveillance. https://janis.mhlw.go.jp/section/icu.html. Accessed 23 Sept 2025.
- 4(SICSQIG) SICSQIG. Quality indicators for critical care in Scotland. 2012. https://scottishintensivecare.org.uk/uploads/2014-01-12-22-37-12-Quality Indicators 2012 pdf-62756.pdf.
- 5ACHS. Intensive Care Clinical Indicators–ACHS clinical indicator users’ manual. 2011. https://www.safetyandquality.gov.au/sites/default/files/migrated/INTENSIVE_CARE_2011.pdf.
- 6NHSN C. Patient Safety Component. https://www.cdc.gov/nhsn/psc/index.html. Accessed 20 Dec 2025.
- 7WHO. IPC/AMR/patient safety. https://www.who.int/publications/i/item/9789241549929. Accessed 20 Dec 2025.
- 8Medicine Io MU Ro E-B. The learning healthcare system: workshop summary. 2007.21452449 · pubmed ↗
