Assessing the accuracy and educational value of YouTube videos on a novel regional anesthesia technique (PENG block)
Ebru Aladağ, Muhammed Emin Zora, Cheong Kim, Cheong Kim, Cheong Kim

TL;DR
This study assesses the quality and educational value of YouTube videos about a new regional anesthesia technique called the PENG block.
Contribution
The study introduces a novel evaluation of online educational content for a new medical technique using validated scoring systems.
Findings
Overall video quality was moderate, with procedural and institutional videos scoring higher.
The study highlights the educational potential of YouTube for medical training.
It also emphasizes the need for better quality control in online medical content.
Abstract
The Pericapsular Nerve Group (PENG) block is a novel regional anesthesia technique that provides adequate analgesia while preserving motor function. This cross-sectional study evaluated the quality, reliability, and educational value of YouTube videos on the PENG block. Thirty-six videos were analyzed using validated scoring systems (GQS, JAMA, DISCERN, and modified DISCERN). Overall video quality was moderate, with higher scores observed in procedural and institutional videos. The findings highlight both the educational potential and the need for quality control in online medical content.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHealth Literacy and Information Accessibility · Music Therapy and Health · Health Education and Validation
Introduction
Effective postoperative pain management is critically essential for accelerating recovery, reducing opioid use and its side effects, and increasing patient satisfaction [1]. In recent years, the popularity of ultrasound-guided fascial plane blocks has increased due to their technical ease of application and their potential for opioid-sparing effects.
The Pericapsular Nerve Group (PENG) block, which targets the extensive anterior capsular innervation of the hip joint, was described in 2018 by Girón-Arango et al. as a novel technique based on the administration of local anesthetic into the fascial plane between the psoas muscle and the superior pubic ramus [2]. The PENG block has gained attention for providing adequate analgesia in surgeries such as total hip arthroplasty and hip fracture, while preserving motor function [3,4]. Randomized controlled trials have demonstrated that the PENG block reduces postoperative pain, decreases analgesic requirements, and facilitates functional recovery [5,6].
In the context of education, the use of video-sharing platforms, such as YouTube, has become increasingly prevalent, particularly among medical students and professionals [7,8]. Due to its wide accessibility, low cost, and capacity to facilitate visual learning, YouTube has become an essential tool in modern medical education [9,10]. However, the uncontrolled nature of the platform may result in the dissemination of misleading or inadequate information through videos [11].
To date, no study has systematically examined the quality, reliability, and educational utility of YouTube videos related to the PENG block. This study aims to evaluate the content of PENG block videos available on YouTube and assess their quality, reliability, and usefulness. We hypothesize that videos related to the PENG block generally possess moderate quality and reliability, but these levels significantly differ according to the source of the video.
Materials and methods
Study design
This study is a cross-sectional content analysis examining the quality, reliability, and usefulness of PENG block videos published on the YouTube platform, owned by Alphabet Inc. Since the study was conducted solely on publicly available YouTube videos, no institutional ethics committee approval was required.
Video selection
Between February 26, 2025, and March 26, 2025, two researchers independently searched the YouTube search engine using the keywords “PENG block” and “pericapsular nerve group block.” Among the 450 videos identified, 122 were deemed relevant to the PENG block. After excluding duplicates and non-English videos, 36 videos meeting the inclusion criteria were included in the study. The inclusion criteria were as follows:
(1)Demonstrating the technique and/or application of the PENG block(2)Being in English.
The exclusion criteria included animations, conference recordings, advertisements, and videos with poor audio or visual quality. The titles, URLs, and fundamental metrics of the videos (views, likes, comments, upload date) were recorded.
Evaluation process
The videos were independently assessed by two anesthesiology specialists, blinded to each other’s evaluations. Four different scales were used in the assessments:
Global Quality Score (GQS): Rates the overall quality and flow of the video on a scale of 1–5; 1–2 = low, 3 = moderate, 4–5 = high quality [12].Journal of the American Medical Association (JAMA) Benchmark: Scores authorship, attribution, disclosure, and currency on a scale of 0–4; 1 = low, 2–3 = moderate, 4 = high accuracy [13].DISCERN: Rates content reliability on a scale of 0–75; 0–39 = poor, 40–59 = moderate, ≥ 60 = good quality [14].Modified DISCERN (mDISCERN): Assesses content reliability on a scale of 0–5; < 3 = poor, 3 = moderate, > 3 = good reliability [12].
Inter-rater agreement was calculated using the Intraclass Correlation Coefficient (ICC) and was found to be greater than 0.96 for all scales, indicating excellent agreement.
Video metrics analysis
The number of views, likes, comments, video duration, time since upload, and Video Power Index (VPI) were recorded for each video. Videos were also categorized by content type (presentation or demonstration) and uploader type (individual, academic, manufacturer, or educator).
Statistical analysis
Data analysis was performed using Jamovi version 2.6.22 (Windows). Data distribution was assessed with the Shapiro–Wilk test and was found to be non-normal; therefore, non-parametric tests were used. Comparisons between groups were made using the Mann–Whitney U test, and relationships between scales were evaluated with Spearman’s correlation coefficient. Statistical significance was set at p < 0.05.
Results
Inter-rater agreement
The accuracy, quality, and reliability of the videos were assessed by two independent reviewers using the GQS, JAMA, DISCERN, and Modified DISCERN scales. Inter-rater consistency was excellent across all scales, with an ICC greater than 0.96 (Table 1).
Table 1: Inter-rater agreement (ICC) ın the evaluation of the videos.
Overall quality, accuracy, and reliability
According to the GQS, 13.9% of the videos were of high quality (scores 4–5), 44.4% were of moderate quality (score 3), and 41.7% were of low quality (scores 1–2). According to the JAMA benchmark, 19.4% exhibited high accuracy, 75.0% moderate accuracy, and 5.6% low accuracy. According to the Modified DISCERN, 25.0% demonstrated good reliability, 36.1% moderate, and 38.9% poor reliability. According to the DISCERN scale, 13.9% of the videos were of good quality, 55.6% were moderate, and 30.6% were of low quality (Table 2).
Table 2: Score distributions and classifications of the videos according to quality, accuracy, and reliability scales (N = 36).
Video metrics
The mean duration of the videos was 10.9 ± 9.16 minutes (median: 7.39), with a mean number of views of 23,766, likes of 230, comments of 8.92, and a Video Power Index (VPI) of 101. The detailed distribution of the video metrics is presented in Table 3.
Table 3: Basic platform metrics of the videos (N = 36).
Distribution by content type and source
Of the videos, 97.2% featured presentation content and 30.6% included application content. Most of the videos were uploaded by individuals (91.7%) and academic (100%) sources, while 97.2% were educational and 22.2% were uploaded by manufacturers.
Comparison of application content
Videos with application content were shorter in duration compared to those without (8.14 min vs. 12.14 min; p = 0.276); however, interaction metrics and some quality scores were significantly higher (Table 4). In particular, VPI (p = 0.008), number of views (p = 0.002), likes (p = 0.009), comments (p = 0.022), and DISCERN score (p = 0.025) were higher in application videos. There was also a significant difference in GQS (p = 0.002), whereas no significant differences were observed for JAMA and Modified DISCERN scores.
Table 4: Metrics and quality/reliability scores of videos with and without application content (N = 36).
Comparison between individual and other sources
Differences between videos from non-individual sources using metrics such as VPI, likes, comments, and quality scores as compared to individual videos were not statistically significant.
Comparison between manufacturer and other sources
Videos uploaded by manufacturers had significantly higher numbers of views (p = 0.021), likes (p = 0.007), comments (p = 0.001), as well as significantly higher DISCERN (p = 0.008), GQS (p = 0.001), and Modified DISCERN (p = 0.014) scores (Table 5). No significant difference in video duration was observed (p = 0.837).
Table 5: metrics and quality/reliability scores of videos from manufacturers and other sources (N = 36).
Correlation analysis
Video duration was positively correlated with JAMA (r = 0.434, p < 0.01), Modified DISCERN (r = 0.422, p < 0.05), and DISCERN (r = 0.340, p < 0.05) scores. Engagement metrics (views, likes, comments) showed strong positive correlations with each other (r > 0.8, p < 0.001). There was also a robust correlation between DISCERN and GQS (r = 0.882, p < 0.01; Table 6).
Table 6: Correlations between video metrics and quality/reliability scores (Spearman’s Rho, N = 36).
Discussion
In our study, the overall information quality and reliability of the evaluated YouTube videos were found to be moderate. This finding is consistent with previous studies reporting that health-related content on YouTube is generally low-to-moderate or suboptimal in quality [15,16]. For example, a comprehensive analysis of chemotherapy-related YouTube videos reported an average quality of only “moderate-to-low” [15]. Similarly, more than half of the videos on testicular cancer were found to be of poor quality, and only a few met high scientific standards [16]. These findings suggest that while many health videos on YouTube meet basic accuracy criteria, they often contain superficial or incomplete information. Indeed, the study by Loeb et al. revealed that 77% of popular prostate cancer videos included misleading or biased information [17]. This highlights that content with high views and likes on YouTube is not necessarily reliable. Therefore, health professionals should remain vigilant regarding the information patients obtain from YouTube and intervene to correct potential misunderstandings when necessary.
The type of content and presentation format play an essential role in both engagement and quality. In our study, videos with application (demonstration) content received significantly more views, likes, and comments compared to presentation-only videos, and also exhibited higher average quality and reliability scores. This suggests that visual content demonstrating practical applications is more engaging and educational for viewers. The literature also supports that patient education videos with visual and demonstrative elements achieve higher engagement than other categories. For example, patient information videos—comprising the majority of chemotherapy-related YouTube content—received significantly more views and likes (p < 0.05) than technical videos intended for healthcare professionals, and their quality scores were generally moderate to high [15]. This suggests that well-structured, application-based content can both reach a broader audience and achieve satisfactory content quality.
Nevertheless, popularity alone should not be considered a guarantee of quality. Some studies have shown that longer and more detailed videos are associated with significantly higher quality scores [18]. This suggests that comprehensive presentations add more value to viewers, but they must still be designed to remain engaging and understandable, even when lengthy.
The source of the video (i.e., the uploader type) is another key determinant of content reliability. In our study, videos uploaded by manufacturers or institutional sources demonstrated higher average quality and reliability scores than those uploaded by individual users. This indicates that videos produced by official organizations or industry may be prepared with greater rigor. Indeed, previous studies have shown that videos uploaded by healthcare professionals, universities, or hospitals score significantly higher in quality compared to those prepared by independent individuals. For example, Duran et al. reported that testicular cancer videos uploaded by urologists and academic institutions had significantly higher DISCERN, JAMA, and GQS scores than those from other sources [16]. Similarly, a study of medical videos on YouTube found that content produced by academic institutions achieved the highest reliability scores. However, some commercial company channels also scored above average on specific measures, such as HONcode criteria [17]. Notably, in our study, no statistically significant difference in quality was observed between individual and institutional uploaders. This suggests that while institutional videos tend to have higher scores on average, expert individual content creators can also deliver high-quality information comparable to institutional sources [19]. In other words, individual physicians or specialists can also produce reliable, widely viewed content on platforms like YouTube. Nevertheless, it should not be overlooked that source reliability can be inconsistent; as Loeb et al. observed, even among the most viewed videos, an inverse relationship may exist between view count and scientific quality [17]. Therefore, viewers should pay attention to the expertise of the video source and verify the information presented through independent references.
Another noteworthy finding in our evaluation is the consistency between the assessment scales used. Positive and strong correlations were observed among DISCERN, GQS, and JAMA scores, particularly a very strong correlation between DISCERN and GQS (e.g., r ≈ 0.88). This indicates that different assessment tools yield consistent results regarding the overall quality of a video, meaning that high-quality content scores well across all measures regardless of the specific scale used. In the literature, DISCERN and GQS scores have been reported to correlate closely in medical YouTube videos; for instance, one study reported a significant positive correlation between these two scores [20]. In our research, the fact that the scales produced mutually confirming results suggests that our evaluation method was reliable and presented a consistent picture of video quality classification. In future research, though using a single scale may indicate overall quality, using multiple scales would enhance reliability by providing cross-validation.
In light of these findings, several important implications for the future of health communication on YouTube can be drawn. First, YouTube has great potential in the health domain; when used appropriately, the platform can serve as a powerful tool for educating and informing the public. Indeed, a recent study emphasized that videos on myocardial infarction on YouTube provided consistent, high-quality information, suggesting that YouTube can contribute to raising awareness and facilitating early intervention in this critical health issue [21]. However, current fluctuations in content quality and the risk of misinformation may limit this potential. Therefore, both platform administrators and health authorities have essential responsibilities in improving content oversight and quality. To ensure the delivery of reliable health information on YouTube and similar platforms, several strategies can be proposed:
(1)Healthcare professionals and academic institutions should be encouraged to take a more active role in producing evidence-based content on digital platforms. Educational videos prepared by experts in an understandable language, incorporating demonstrative elements, can combine high quality with audience engagement.(2)The publication of videos according to established standards should be promoted. For example, applying internationally recognized HONcode principles or checklists developed for medical content production can improve quality. For surgical videos, the use of guidelines such as LAP-VEGaS has been recommended to enhance educational value [22].(3)At the platform level, steps should be taken to make high-reliability content more visible and to identify and flag misleading content. YouTube’s collaboration with healthcare organizations to grant verified badges to trustworthy channels and to improve its search algorithm to prioritize reliable sources is valuable in this regard.(4)Finally, efforts to improve patients’ digital health literacy should be intensified. Teaching viewers how to critically evaluate online video content (e.g., verifying sources, recency, and level of evidence) can help mitigate the impact of misinformation.
The findings of this study highlight the importance of striking a balance between optimism and caution in online health communication. While the current moderate quality leaves room for improvement, the success of demonstrative and expert-sourced content suggests that the educational potential of the platform can be enhanced with the right strategies. Future research should examine the direct effects of YouTube health videos on patients’ clinical decision-making, treatment adherence, and health outcomes [23]. In doing so, it will become clearer which types of content truly provide benefit, and content creators can be guided accordingly. In conclusion, in the digital age, platforms like YouTube play a central role in disseminating health information globally; ensuring this role develops positively requires guaranteeing access to accurate information. Strengthened collaboration between health authorities, content creators, and platform providers will likely facilitate the future development of higher-quality, correct, and reliable health communication on YouTube.
Conclusion
This study revealed that health-related videos on YouTube generally offer moderate quality and reliability but exhibit significant differences depending on content type and source. Videos with application and visually rich content had higher views and like counts as well as better quality scores. In contrast, videos uploaded by academic institutions and institutional sources provided more reliable content compared to those uploaded by individual users. Moreover, the strong agreement among quality assessment tools such as DISCERN, GQS, and JAMA supports their reliability as evaluation measures.
These findings highlight both the potential of YouTube for health education and awareness, as well as the risks associated with variability and misinformation. Future research should assess the direct impact of YouTube health videos on patients’ clinical decision-making, treatment adherence, and health outcomes, as well as scientifically determine which content types are genuinely beneficial. This would enable both content creators and health authorities to adopt more strategic approaches to addressing health issues.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Steinhaus ME, Rosneck J, Ahmad CS, Lynch TS. Outcomes After Peripheral Nerve Block in Hip Arthroscopy. Am J Orthop (Belle Mead NJ). 2018;47(6). doi: 10.12788/ajo.2018.0049 29979805 · doi ↗ · pubmed ↗
- 2Girón-Arango L, Peng PWH, Chin KJ, Brull R, Perlas A. Pericapsular Nerve Group (PENG) Block for Hip Fracture. Reg Anesth Pain Med. 2018;43(8):859–63. doi: 10.1097/AAP.0000000000000847 30063657 · doi ↗ · pubmed ↗
- 3Morrison C, Brown B, Lin D-Y, Jaarsma R, Kroon H. Analgesia and anesthesia using the pericapsular nerve group block in hip surgery and hip fracture: a scoping review. Reg Anesth Pain Med. 2021;46(2):169–75. doi: 10.1136/rapm-2020-101826 33109730 · doi ↗ · pubmed ↗
- 4Karaoğlan M, Küçükçay Karaoğlan B. PENG for chronic pain: the clinical effectiveness of pericapsular nerve group block in chronic hip pain. Hip Int. 2024;34(4):524–36. doi: 10.1177/11207000241227542 38380579 · doi ↗ · pubmed ↗
- 5Et T, Korkusuz M. Comparison of the pericapsular nerve group block with the intra-articular and quadratus lumborum blocks in primary total hip arthroplasty: a randomized controlled trial. Korean J Anesthesiol. 2023;76(6):575–85. doi: 10.4097/kja.23064 37013389 PMC 10718628 · doi ↗ · pubmed ↗
- 6Kukreja P, Uppal V, Kofskey AM, Feinstein J, Northern T, Davis C, et al. Quality of recovery after pericapsular nerve group (PENG) block for primary total hip arthroplasty under spinal anaesthesia: a randomised controlled observer-blinded trial. Br J Anaesth. 2023;130(6):773–9. doi: 10.1016/j.bja.2023.02.017 36964012 · doi ↗ · pubmed ↗
- 7Burke S, Snyder S, Rager R. An Assessment of Faculty Usage of You Tube as a Teaching Resource. IJAHSP. 2009. doi: 10.46743/1540-580x/2009.1227 · doi ↗
- 8Duncan I, Yarwood-Ross L, Haigh C. You Tube as a source of clinical skills education. Nurse Educ Today. 2013;33(12):1576–80. doi: 10.1016/j.nedt.2012.12.013 23332710 · doi ↗ · pubmed ↗
