Validity and reliability of the Critical-Care Pain Observation Tool (CPOT) for critically ill pediatric patients

Haruhiko Hoshino; Mitsuki Ikeda; Yujiro Matsuishi; Yuki Enomoto; Nobutake Shimojo; Misaki Kotani; Shunsuke Kobayashi; Takahiro Kido; Satomi Hayashi; Yoko Furuya; Yoshiaki Inoue

PMC · DOI:10.1371/journal.pone.0320373·April 18, 2025

Validity and reliability of the Critical-Care Pain Observation Tool (CPOT) for critically ill pediatric patients

Haruhiko Hoshino, Mitsuki Ikeda, Yujiro Matsuishi, Yuki Enomoto, Nobutake Shimojo, Misaki Kotani, Shunsuke Kobayashi, Takahiro Kido, Satomi Hayashi, Yoko Furuya, Yoshiaki Inoue

PDF

Open Access

TL;DR

This study evaluates the Critical-Care Pain Observation Tool (CPOT) for assessing pain in critically ill pediatric patients and finds it to be reliable and valid.

Contribution

The study confirms the validity and reliability of CPOT for use in critically ill pediatric patients.

Findings

01

CPOT strongly correlated with VAS obs and FLACC scales in pain assessment.

02

CPOT showed high sensitivity and specificity at a cut-off score of 3.

03

Inter-rater reliability of CPOT was high among research nurses.

Abstract

In some regions, critically ill pediatric and adult patients are cared for in the same intensive care unit, complicating pain assessment due to mixed age groups. To address this, it is essential to use pain scales that are applicable to a wide age range. The Critical-Care Pain Observation Tool (CPOT) was developed to assess pain in both intubated and non-intubated adult patients. However, its applicability in pediatric patients has not been confirmed. The purpose of this study was to evaluate CPOT for critically ill pediatric patients. We conducted a prospective observational study in an eight-bed open PICU from January 2022 to March 2023. Three research nurses independently assessed pain using CPOT, the Face, Legs, Activity, Cry, Consolability (FLACC) scale, and an Observational Visual Analog Scale (VAS obs). Criterion-related and construct validity were examined using Spearman’s rank…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Diseases2

Pain critically ill

Figures1

Click any figure to enlarge with its caption.

Fig 1 — Receiver Operating Characteristic (ROC) curves for the Critical-Care Pain Observation Tool (CPOT) in critically ill pediatric patients, with different CPOT cut-off scores compared against the FLACC scale score of 4 or higher as the pain criterion.The blue line represents CPOT ≥ 2 (AUC = 0.95), the yellow line represents CPOT ≥ 3 (AUC = 0.98), and the green line represents CPOT ≥ 4 (AUC = 0.89). The optimal CPOT cut-off score was identified as 3, with a sensitivity of 100% and a specificity of 96.7%.

Funding1

—Japan Society for Aeronautical and Space Sciences “The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.”

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntensive Care Unit Cognitive Disorders · Pediatric Pain Management Techniques · Family and Patient Care in Intensive Care Units

Full text

Introduction

Managing pain in pediatric patients in the Pediatric Intensive Care Unit (PICU) presents complex challenges that are influenced by various factors. Critically ill children in the PICU frequently undergo invasive procedures that are inherently painful, necessitating meticulous pain management approaches [1]. One of the primary challenges is the accurate assessment of pain, which is particularly problematic in critical care environments [2]. Various assessment tools and scales have been validated to evaluate pain levels in critically ill pediatric patients, including the Face, Legs, Activity, Cry, Consolability (FLACC) scale and COMFORT-B [3,4]. The indications for these tools for pediatric patients are also recommended in the guidelines [5].

However, it is important to note that not all critically ill pediatric patients are treated in dedicated PICUs, and in some facilities—both in Japan and elsewhere—pediatric and adult patients may share the same unit [6–8]. Caring for critically ill pediatric patients requires specialized skills; however, pediatric patients with critical conditions are not always managed exclusively in a PICU. In these mixed-age units, the requirement to appropriately assess patients of all ages can increase the workload of medical staff and potentially lead to inadequate pain assessment. Age-inclusive pain scales are essential for effective management. The Behavioral Pain Scale (BPS) has been evaluated in pediatric patients [9]. However, as this tool is primarily designed for intubated populations, its use in non-intubated children is limited, highlighting the need for more versatile pain assessment tools.

The Critical-Care Pain Observation Tool (CPOT) was originally validated in adult patients [10]. The tool evaluates four specific criteria that could conceivably be applicable to pediatric patients: facial expression, body movements, compliance with the ventilator for intubated patients, and vocalization for extubated patients. Despite its widespread use in adult ICUs, its applicability in critically ill pediatric patients remains underexplored [11].

This study aimed to evaluate the validity and reliability of the Critical Care Pain Observation Tool (CPOT) in pediatric patients in a critical care setting.

Methods

Study design and patients

This study was a prospective observational research project conducted between January 2022 and March 2023. The subjects were pediatric patients admitted to the eight-bed open PICU at the University of Tsukuba Hospital. Patients with central nervous system diseases, a history of neurological conditions such as hypoxia and cerebral palsy, and those receiving muscle relaxants were excluded from this study because of the potential difficulty in accurately assessing pain levels. Demographic and clinical data were collected from participants. A detailed explanation of the study was provided both verbally and in writing. As all participants in this study were minors, written informed consent was obtained from their parents or guardians. This consent process was documented, and written records were retained by the research team. In accordance with the ethical standards of the responsible committee on human experimentation (institutional or regional) and the 2013 version of the Helsinki Declaration, this study was approved by the Ethics Committee of the University of Tsukuba Hospital on August 8, 2016 (approval #H28-085, study title: The Study on Delirium, Pain, Sedation, and Withdrawal Symptoms in Pediatric Intensive Care Unit). As additional tools, including the CPOT, were incorporated into the study, amendments were submitted to the ethics committee and subsequently approved. All procedures performed in this study involving human participants complied with the ethical standards of the institution.

Procedure

To assess the validity and reliability of the CPOT in measuring children’s pain, a team comprising three research nurses independently, blindly, and simultaneously employed three different pain measurement scales: CPOT, FLACC scale, and the observational Visual Analog Scale (VAS obs). To minimize potential bias, the VAS obs was administered prior to the CPOT and FLACC assessments. Multiple raters were employed to enhance inter-rater reliability testing.

Pain scales

The CPOT was designed to assess pain in critically ill adult patients who are unable to communicate verbally due to conditions such as sedation, intubation, or their underlying diagnosis [10]. CPOT consists of four distinct categories: facial expression, body movements, compliance with the ventilator (intubated patients) or vocalization (extubated patients), and muscle tension—each rated on a three-point scale (0–2), with a total score of 0 to 8. A back-translated Japanese version of the CPOT was used in this study [12]. The FLACC scale is a behavioral assessment tool designed to quantify postoperative pain intensity in children [4]. Each of its five categories—face, legs, activity, cry, and consolability—is rated on a scale from 0 to 2, resulting in a total score ranging from 0 (no pain) to 10 (severe pain), with a threshold value of 4. This scale is also recommended by guidelines as a pain assessment tool for critically ill patients [5]. Our previous research has confirmed the validity and reliability of the Japanese version of the FLACC scale [13]. The FLACC scale is the approved tool currently employed to evaluate pain in the PICU of our hospital. The visual analog scale (VAS), first introduced in 1921, was a unidimensional tool for quantifying subjective pain intensity using a 10 cm line labeled from “no pain” to “worst imaginable pain” [14]. However, since younger pediatric patients cannot self-report their pain, we utilized the observational VAS (VAS obs) instead. The VAS obs is a unidimensional tool in which healthcare professionals assess and pain based on observed patient symptoms, using a 10 cm line ranging from ‘no pain’ to ‘worst imaginable pain’ similar to the VAS. The use of VAS obs for assessing the validity of pain scales in neonates and children has been reported in previous studies [15,16].

Sample size

The sample size was calculated using the Spearman’s correlation coefficient. Because the required number of participants decreases as Spearman’s ρ increases (e.g., 0.7 = 18, 0.6 = 24, 0.5 = 33, 0.4 = 51), we chose ρ = 0.3 as the minimal threshold for detecting a “low” correlation. According to previous definitions, |ρ| < 0.3 indicates a negligible correlation, whereas 0.3 ≤ |ρ| < 0.5 represents a low correlation [17]. We aimed to detect this threshold at a significance level (α) of 0.05 and a power (1−β) of 0.8. From these calculations, we determined that we needed at least 89 participants.

Statistical analysis

Patient characteristics, including demographics such as age, sex, mortality risk, intubation status, and diagnoses, were obtained from clinical charts. Criterion-related validity of the CPOT was assessed through its correlation with the VAS obs, while construct validity was evaluated via its correlation with the FLACC scale, using Spearman’s rank correlation coefficient (ρ). Receiver Operating Characteristic (ROC) curve analysis was used to determine the diagnostic performance of the CPOT, with an FLACC score of 4 or higher as the threshold. The Mann–Whitney U-test was used to compare differences in CPOT scores during medical interventions, such as intravenous catheterization, suctioning, and bathing. The inter-rater reliability for CPOT scores was assessed using Cohen’s weighted κ test.

Data analysis was conducted using Python programming language (version 3.8) and EZR (Saitama Medical Center, Jichi Medical University, Saitama, Japan), a graphical user interface for R (The R Foundation for Statistical Computing, Vienna, Austria) [18]. All statistical tests were two-sided, and a p-value of less than 0.05 was considered statistically significant. Bootstrap methods with 1,000 resamples were used to calculate 95% confidence intervals (CIs). Data manipulation was performed using the pandas library (version 1.2.3), and statistical computations were executed using the scikit-learn library (version 0.24.1).

Results

In total, 135 patients were enrolled in this study. After excluding 20 patients with central nervous system diseases and an additional 24 patients with a history of neurological conditions, 91 pediatric patients were included in the study (Table 1). As shown in Table 2, these 91 patients were observed a total of 165 times (each subject was evaluated between one and five times). The median age of the included patients was 13 months (range: 0–214 months).

Table 1: Characteristics and diagnosis of study subjects.

Table 2: Characteristics and diagnosis of assessments.

Validity

As demonstrated in Table 3, criterion-related validity was assessed using CPOT and VAS obs scores, while construct validity was examined using Spearman’s rank correlation coefficient (ρ) between CPOT and FLACC. The correlation coefficients were high, with all values exceeding 0.7, indicating a strong and significant relationship (p < 0.01). This finding underscores the robust validity of CPOT in diverse patient groups.

Table 3: Spearman rho correlation between CPOT and VAS obs and between CPOT and FLACC.

The optimal cut-off score for CPOT was greater than 3, as demonstrated by the Receiver Operating Characteristic (ROC) curve (Fig 1). The AUC (Area Under the Curve), sensitivity, specificity, PPV (Positive Predictive Value), and NPV (Negative Predictive Value) at cut-off scores of ≥ 2, ≥ 3, and ≥ 4 are shown in Table 4. In particular, the CPOT cut-off of ≥ 3 yielded excellent diagnostic performance, with an AUC of 0.98, sensitivity of 100%, and specificity of 96.7%. The positive likelihood ratio was 30.2 and the negative likelihood ratio was less than 0.01 at this threshold, highlighting the clinical significance of this cut-off score. CPOT scores were significantly higher in patients who were undergoing medical interventions than in those who were not (p < 0.01).

Table 4: The AUC, sensitivity, specificity, positive likelihood ratio, and negative likelihood ratio for each CPOT score were calculated based on a FLACC score of 4 or higher.

Receiver Operating Characteristic (ROC) curves for the Critical-Care Pain Observation Tool (CPOT) in critically ill pediatric patients, with different CPOT cut-off scores compared against the FLACC scale score of 4 or higher as the pain criterion.The blue line represents CPOT ≥ 2 (AUC = 0.95), the yellow line represents CPOT ≥ 3 (AUC = 0.98), and the green line represents CPOT ≥ 4 (AUC = 0.89). The optimal CPOT cut-off score was identified as 3, with a sensitivity of 100% and a specificity of 96.7%.

Reliability

The inter-rater reliability of the CPOT scores across observers, as assessed by two nurses, was greater than 0.8, as demonstrated in Table 5, indicating strong agreement between nurses in evaluating pain using the CPOT scale. Weighted κ values ranged from 0.74 to 1.0 across different patient subgroups, further confirming the reliability of CPOT in this pediatric critical care setting.

Table 5: Measures of agreement between the observers.

Discussion

We evaluated the validity and reliability of the CPOT in critically ill pediatric patients. CPOT was originally developed to measure pain in critically ill adult patients who were unable to communicate verbally because of their diagnosis, sedation, or intubation [10]. Given its widespread international adoption and high regard for its validity and reliability in adult patient populations [11], we investigated whether this tool, initially conceived for adults, could also be applied in pediatric settings. The results of our study indicate that CPOT items were observable in pediatric patients, providing new insights its potential application and suggesting that it can be adapted effectively to younger populations.

In this study, CPOT scores exhibited a high correlation with VAS obs scores, as evaluated by the attending nurses. The VAS obs is a unidimensional tool designed to quantify pain intensity, utilizing a 10 cm line with endpoints labeled ‘no pain’ and ‘worst imaginable pain.’ This scale is widely used to compare pain between patients and to monitor the course of pain in individual patients. Previous research has employed the VAS obs to evaluate the validity of pain scales in pediatric populations [19,20]. Additionally, CPOT scores were highly correlated with the FLACC scale. This is particularly significant because the FLACC scale is frequently used to assess critically ill pediatric patients [21]. Our research team has extensive experience with the FLACC scale, which evaluates its domestic version [13]. CPOT demonstrated a high correlation with both the VAS obs and FLACC scales in a pediatric setting. Moreover, the strong correlation and the significant difference in CPOT scores during medical interventions collectively underline the scale’s capacity to capture fluctuations in pediatric pain states.

In addition, this study found that when using the FLACC scale as a benchmark for pain assessment, a CPOT positive threshold score of three or higher yielded the highest AUC. This differs from the original CPOT, which typically uses a cut-off score of 2 (ranging from 0 to 8 across four categories). However, it is important to note that the cutoff value may vary depending on the population and situation, and past reports have shown cutoff values ranging from 2 to 3 in adults [22]. This suggests that there is a high likelihood of multiple cut-off values being reported for pediatric populations in future studies.

The higher the value of weighted kappa, the stronger the agreement, with values ranging from 0 to 1. Values between 0.81 and 1.00 indicate “almost perfect” agreement, whereas those between 0.61 and 0.80 are considered “substantial” [23]. In our study, weighted kappa values for CPOT—including all observations and subgroups—fell within these ranges, demonstrating high levels of inter-rater reliability. The four dimensions of the CPOT demonstrated moderate to excellent inter-rater reliability. However, for ‘body movement’ the agreement was only ‘substantial’ likely because of differing assessments of subtle movements in children. Compared with adults, children tend to move more restlessly, and this characteristic may have impacted the consistency of the evaluations. Therefore, special attention may be needed when assessing ‘body movement’ in pediatric patients. In prior research, the agreement among nurses for scoring CPOT has also been reported to be ‘moderate’ [24,25]. Considering these results, the inter-rater reliability obtained in our study could indicate the reliability of CPOT in pediatric patients.

Pain assessment is limited in some areas [26–28], and we believe that one of the reasons is the inconvenience of switching between assessment tools depending on each patient’s age. Implementing CPOT in mixed-age intensive care units could address this issue by allowing both pediatric and adult patients to be evaluated with a single tool, thereby promoting consistency in pain assessment across different age groups. Furthermore, CPOT’s applicability to both intubated and extubated patients simplifies the measurement process, making it feasible to monitor pain continuously throughout the course of critical care. Such a unified approach may not only streamline clinical workflows but also foster more integrated pain management protocols. Taken together, these advantages underscore the value of exploring CPOT’s broader adoption in settings where patient demographics and ventilation status frequently vary.

This study had several limitations. First, patients with neurological conditions were excluded. Previous studies have used the FLACC and CPOT scales to assess these patient populations [29,30]. The evaluation of pain in this group is very important, and it is necessary to investigate whether CPOT can be used for assessment in the future. We are considering an evaluation of this group in future research. Excluding these patients might introduce a selection bias, as neurological impairment can significantly alter pain perception and expression. Second, the assessment data were analyzed only by a single pair of nurses. Therefore, it should be considered that this pair could have unconsciously assigned favorable VAS scores to the still unestablished CPOT and FLACC scales. In future studies, it will be important to include a larger number of nurses to confirm continued inter-rater reliability. Third, this study was conducted as an initial validation test in a single ICU with a relatively small number of pediatric patients and limited associated clinical diagnoses. Future trials involving a larger cohort, multiple PICU, and a wider spectrum of primary clinical diagnoses (e.g., surgeries other than cardiovascular, primary pulmonary disease, acute infection) are required for further validation and foundational data collection. To address these limitations, we recommend a multi-center approach to enhance generalizability, alongside refining the scoring criteria to better account for varied neurological and developmental statuses. Moreover, it is crucial to validate CPOT among patients receiving high flow nasal cannula oxygen and non-invasive positive ventilation, as these methods of respiratory support are increasingly being utilized on a consistent basis.

Overall, this study contributes to the existing literature by offering an initial demonstration that CPOT is both valid and reliable in critically ill pediatric populations, thereby expanding its known scope from adult-only usage. While our findings highlight CPOT’s feasibility and potential to unify pain assessment across age groups, future research will be critical for corroborating and refining these observations through larger and more diverse patient populations.

Conclusions

The results of this study suggest that the CPOT could be a useful tool for pain assessment in pediatric patients. Further studies are urgently required to validate these findings.

Supporting information

S1 FileAssesment.(XLSX)

S2 FilePatient.(XLSX)

Bibliography30

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1La Fond CM, Hanrahan KS, Pierce NL, Perkhounkova Y, Laures EL, Mc Carthy AM. Pain in the pediatric intensive care unit: How and what are we doing? Am J Crit Care. 2019;28(4):265–73. doi: 10.4037/ajcc 2019836 31263009 · doi ↗ · pubmed ↗
2Laures E, La Fond C, Hanrahan K, Pierce N, Min H, Mc Carthy AM. Pain assessment practices in the pediatric intensive care unit. J Pediatr Nurs. 2019;48:55–62. doi: 10.1016/j.pedn.2019.07.005 31325800 · doi ↗ · pubmed ↗
3van Dijk M, de Boer JB, Koot HM, Tibboel D, Passchier J, Duivenvoorden HJ. The reliability and validity of the COMFORT scale as a postoperative pain instrument in 0 to 3-year-old infants. Pain. 2000;84(2–3):367–77. doi: 10.1016/s 0304-3959(99)00239-0 10666543 · doi ↗ · pubmed ↗
4Merkel SI, Voepel-Lewis T, Shayevitz JR, Malviya S. The FLACC: a behavioral scale for scoring postoperative pain in young children. Pediatr Nurs. 1997;23(3):293–7. 9220806 · pubmed ↗
5Smith HAB, Besunder JB, Betters KA, Johnson PN, Srinivasan V, Stormorken A, et al. 2022 Society of critical care medicine clinical practice guidelines on prevention and management of pain, agitation, neuromuscular blockade, and delirium in critically ill pediatric patients with consideration of the ICU environment and early mobility. Pediatr Crit Care Med. 2022;23(2):e 74–110. doi: 10.1097/PCC.0000000000002873 35119438 · doi ↗ · pubmed ↗
6Pediatric Intensive Care Committee, Japanese Society of Intensive Care Medicine. Survey of PIC Us in Japan (in Japanese). Journal of the Japanese Society of Intensive Care Medicine. Nihon Shuchu Chiryo Igakukai zasshi. 2019;26(3):217–25. doi: 10.3918/jsicm.26_217 · doi ↗
7Biban P. Should critically ill children be managed in paediatric intensive care units only?. Intensive Care Med. 2011;37(5):886. doi: 10.1007/s 00134-011-2176-6 21350905 · doi ↗ · pubmed ↗
8Christian MD, Kissoon N. Caring for critically ill adults in PIC Us is not “child’s play”. Pediatr Crit Care Med. 2020;21(7):679–81. doi: 10.1097/PCC.0000000000002430 32420719 PMC 7331596 · doi ↗ · pubmed ↗