Evaluating fluoride-related YouTube videos in Japan: A comparative analysis of understandability, actionability, and reliability between pro- and anti-fluoride content
Hikari Sophia Nagao, Tsuyoshi Okuhara, Emi Furukawa, Hiroko Okada, Takahiro Kiuchi

TL;DR
This study compared pro- and anti-fluoride YouTube videos in Japan, finding that pro-fluoride videos are more reliable but less engaging, while anti-fluoride videos with clear content attract more views.
Contribution
This is the first study to combine specific quality tools with YouTube analytics for Japanese fluoride-related content.
Findings
Pro-fluoride videos scored higher in reliability and overall quality but had lower audience reach.
Understandable and well-structured anti-fluoride videos achieved higher view rates.
Only 27% of all videos met the understandability threshold set by PEMAT-A/V.
Abstract
This study aimed to compare pro- and anti-fluoride Japanese YouTube videos on understandability, actionability, flow, reliability, and engagement. Eighty-four videos found via a keyword search (pro = 49, anti = 18, other = 17) were analyzed. Quality was assessed using three validated tools—namely, Patient Education Materials Assessment Tool for Audiovisual Materials (PEMAT-A/V), Global Quality Score (GQS), and modified DISCERN (mDISCERN). Engagement was measured as view rate. Pro-fluoride videos scored higher on reliability (mDISCERN 2.6 ± 1.0 vs 1.3 ± 0.6) and overall quality (GQS 2.5 ± 1.4 vs 1.0 ± 0.9; both p < 0.001). No group differences emerged for understandability (58% versus 53%; p = 0.23) or actionability (60% versus 53%; p = 0.33). In the anti-fluoride group, a higher view rate positively correlated with understandability and GQS (ρ ≈ 0.53; p ≈ 0.03); no correlation was…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInorganic Fluorides and Related Compounds · Health Literacy and Information Accessibility · Fluorine in Organic Chemistry
Introduction
1
In recent years, methods of obtaining health-related information have changed substantially. Although such information has traditionally been disseminated primarily through face-to-face interaction, rapid advances in digital technologies have established social media as a key health information source. Today, individuals can readily access information via computers and smartphones, enabling real-time information sharing and exchange irrespective of time or location.
These shifts in the digital landscape have significantly influenced health-related behaviors. Among various platforms, YouTube—the world's largest video-sharing service—has gained attention as an online health information source [1]. As of 2024, YouTube has approximately 2.5 billion monthly active users globally. A similar trend is evident in Japan, where a 2023 survey by the Ministry of Internal Affairs and Communications on media usage time and information behavior found that over 90% of individuals in their teens to forties use YouTube [2]. Moreover, with over 70 million monthly users, YouTube is one of Japan's most frequently used social media platforms [3].
Vlogs, or video blogs, are experience-based videos created by individuals on varied topics and are typically posted on platforms such as YouTube. Vlogs concerning medical and health issues are actively shared. For example, individuals with breast cancer [4] and gestational diabetes [5] have documented their personal experiences through publicly accessible vlogs. These videos draw viewers into their creators' everyday lives; therefore, they tend to achieve high engagement rates, with audiences often continuing to watch subsequent content [6]. Such vlog-based services significantly contribute to shaping patients' experiences in the progression and management of chronic diseases.
Nevertheless, concerns persist regarding the accuracy and reliability of health-related information on YouTube. As anyone with an account can freely upload content, a risk exists that unverified information may circulate without adequate fact-checking [7]. Considering YouTube's widespread influence and ease of access, ensuring its health-related content’ quality and accuracy is imperative.
Prior research has suggested that individuals with low health literacy are more likely to obtain health information through television, social media, blogs, or celebrity websites than their higher-literacy counterparts [8]. However, some of these sources, particularly celebrity websites, blogs, and social media, may provide lower-quality health information than professional sources, such as physicians and dentists [8].
Additionally, studies have indicated that dental caries, like other health disparities, are associated with social determinants and have been referred to as a “canary in the coalmine” [9]. Beyond platform uptake, experimental and review evidence has indicated that presentation design directly impacts comprehension and behavior. Randomized experiments manipulating processing fluency (e.g., typography and layout) have been demonstrated to increase perceived ease, self-efficacy, and behavioral intentions [10]. Meanwhile, systematic reviews have revealed that transparent visual aids reliably enhance risk understanding and informed decision-making across diverse audiences [11,12]. Consequently, ensuring the online availability of accurate, accessible, and actionable health information to all individuals is essential for addressing health disparities.
Population-based studies have suggested that comprehensive health literacy in Japan is heterogeneous and, on average, lower than that in Europe, underscoring the necessity for comprehensible and actionable materials. Validated Japanese instruments (e.g., a Japanese adaptation of the PEMAT) enable the systematic appraisal of these properties in Japanese-language materials. Combined with the characteristics of Japanese orthography (kanji–kana mixing), these factors justify a design-aware approach to evaluating and enhancing online health information in Japan.
Fluoride use is a particularly effective and widely recognized public health intervention for preventing dental caries, a preventable condition [13]. Application methods are generally classified into systemic and topical approaches [14,15]. Community-level water fluoridation is among the most extensively implemented systemic interventions. As of 2012, over 420 million individuals worldwide used fluoridated water, including approximately 50 million who relied on naturally fluoridated sources and approximately 370 million relying on water supplies adjusted to optimal or near-optimal fluoride concentrations [16]. For topical application, fluoride-containing toothpaste, mouth rinses, and professionally applied fluoride treatments in dental clinics are commonly employed at the individual level [14]. In Japan, community water fluoridation has not been implemented as a public health measure, and fluoride is primarily applied at the individual level. In Japan, local governments implement school-based fluoride mouth-rinse programs (S-FMR). Recent studies have reported that geographic variation in naturally occurring fluoride in tap water is associated with lower caries experience among children and that prefectures with broader S-FMR dissemination exhibit lower mean DMFT and smaller inter-prefectural inequalities. However, S-FMR has not been universally adopted across municipalities, and fluoride-based caries prevention remains centered on individual-level behaviors. Thus, for effective caries prevention, individuals must act based on accurate information regarding fluoride use. However, no study has evaluated the understandability and actionability of fluoride-related information readily accessible to the general public.
Fluoride has frequently become the subject of intense debate on social media platforms, where pro- and anti-fluoride perspectives often clash. Opponents of fluoride typically include general users, influencers, and activists lacking dental and scientific expertise. Prior research has revealed that a significant portion of fluoride-related online content is either negative or misleading. An analysis of widely viewed fluoride-related YouTube videos identified an anti-fluoride bias, with the content focusing more on perceived risks than on documented benefits. Among a sample of 100 such videos, most were produced by non-experts, with user-generated content accounting for approximately 69% of total views, suggesting that anti-fluoride or sensational content frequently achieves a greater reach than expert-driven content [17]. On Instagram, anti-fluoride posts often feature emotionally charged messages, such as “Fluoride is poisoning you and your children.” [18]. Posts with strongly negative or sensational tones elicit greater engagement with social media algorithms. Fear-inducing appeals and provocative statements by opponents generate more shares and comments, thereby amplifying the content's reach.
A marked difference exists in both the nature and reliability of online information about fluoride, depending on whether it is disseminated by supporters or opponents, which may influence the public understanding of the topic. A study on the readability of web pages concerning fluoride use found that anti-fluoride websites are generally written in a more accessible style than pro-fluoride websites, potentially making it easier for a wider audience to comprehend, contributing to public confusion on the issue [19]. Pro-fluoride information generally originates from expert sources, with many posts and videos produced by dentists, pediatricians, and public health organizations. However, the reach of these sources may be limited. As previously noted, consumer-generated videos on YouTube receive substantially more views than those produced by experts [17]. Similarly, an analysis of Instagram indicated that most fluoride-related posts are created by non-experts and often achieve high engagement, even when they contain misinformation.
Growing evidence has suggested that online misinformation is associated with fluoride hesitation or refusal in specific communities. For instance, dental practitioners have reported an increasing number of parents declining fluoride application for their children, citing online sources such as “I read it on Facebook” or “I saw it on YouTube” that claim fluoride is harmful [18]. In a study involving interviews with mothers of young children residing in areas with strong anti-fluoride sentiment, two-thirds reported discussing or obtaining fluoride-related information through social networks, including family, friends, other parents, and social media contacts. Numerous mothers indicated that they received conflicting messages. While some sources (e.g., pediatric dentists or scientifically inclined acquaintances) provided reassurance about fluoride's benefits, others (e.g., neighbors or Facebook groups) issued warnings. This inconsistency across multiple sources caused confusion regarding whom to trust and made it difficult for these mothers to discern which fluoride-related advice was most accurate [20]. Moreover, opposition to fluoride use has been identified as a key factor influencing parental refusal of fluoride treatment offered by dental professionals [21]. Studies have also found that parents opposing fluoride often exhibit resistance to vaccinations. Such refusals reportedly contribute to increased disease burden, rising healthcare costs, and preventable illness and suffering among children and their surrounding communities [22].
In the formation of attitudes toward health information, non-logical factors—such as the mode of presentation and perceptions of the information source—are as influential as information reliability or scientific validity. The Elaboration Likelihood Model (ELM) conceptualizes behavior change through two cognitive pathways: the central route, which involves deep processing based on content; and the peripheral route, which involves superficial processing based on emotions or impressions. This framework has been widely applied to evaluate health communication within medical and public health domains [23,24]. Furthermore, individuals tend to imitate others' behaviors or interpret them correctly when faced with uncertainty in their judgment. This psychological phenomenon, referred to as social proof, significantly influences individual decision-making. In the context of health-related topics, platforms such as YouTube function as spaces for peer-based information exchange and support among individuals with similar concerns and impact viewers' knowledge, attitudes, and behaviors.
In recent years, several standardized tools have been developed to assess health information disseminated via social media platforms, such as YouTube; prominent examples include the Patient Education Materials Assessment Tool for Audiovisual Materials (PEMAT-A/V) [25], the Global Quality Score (GQS) [26], and the modified DISCERN (mDISCERN) [27]. These tools enable the systematic evaluation of online health education content's understandability, actionability, and credibility. Previous studies employing these tools have assessed the educational quality and trustworthiness of YouTube videos on various topics, including food poisoning and COVID-19 [28], and within dentistry, on the safety of dental radiography [29] and pediatric oral hygiene instruction [30]. However, to our knowledge, no study has examined YouTube videos on the fluoride use for preventing caries with respect to comprehensibility, practical applicability, scientific reliability, or structural coherence.
Prior YouTube evaluations across telemedicine and urologic oncology have consistently reported low PEMAT A/V scores—particularly for actionability—and only fair DISCERN/GQS performance, with persistent misinformation concerns [31,32]. Similar limitations have been noted for placenta accreta and PDE5 inhibitor content, and mixed quality for prostate cancer mental health videos [[32], [33], [34]]. However, no study has examined Japanese fluoride content while integrating these quality metrics with YouTube analytics.
Previous studies have indicated that factual information alone is insufficient to persuade individuals that fluoride's benefits outweigh its risks. Furthermore, when individuals perceive fluoride as beneficial, their risk perception generally decreases [35]. Therefore, this study aims to compare the quality (understandability, actionability, reliability) and viewer engagement of pro- and anti-fluoride Japanese YouTube videos.
Research question
1.1
Does a difference exist in the degree of understandability, actionability, flow, comprehensiveness, and scientific reliability between pro- and anti-fluoride content shared on social media?
Materials and methods
2
Study design
2.1
We systematically and quantitatively analyzed YouTube videos on fluoride use for caries prevention using standardized health information evaluation tools.
Search strategy
2.2
Video selection and eligibility
2.2.1
On October 15, 2023, we systematically searched YouTube for relevant videos using keyword combinations entered in Japanese text (listed in Supplementary Table 1, with English translations provided)—following methodologies used in previous research analyzing YouTube videos on health-related information [36]. These keywords were identified using Google Trends to capture terms commonly utilized by the general public; we included the top queries from the preceding 12 months to balance recency with sufficient search volume for meaningful trend identification. A comprehensive search was conducted on YouTube using a range of relevant keywords, including “fluoride,” as well as additional terms identified via Google Trends, such as “fluoride AND coating,” “fluoride AND processing,” “dentist AND fluoride,” “fluoride AND coat,” “fluoride AND toothpaste,” “fluoride AND pan coating,” “fluoride AND resin,” “fluoride AND danger,” “fluoride AND rinse,” and “fluoride.” The search results were categorized based on “view count.” The top 50 videos for each keyword were selected for analysis. The search was conducted using Google Chrome in incognito mode to prevent the influence of personal browsing history or tailored search results; the region was set to Japan. After each search, the browser was closed and reopened in incognito mode to minimize personalization. For each search formula, the top 50 results were reviewed on February 10, 2025. Only videos with clear pro- or anti-stances were included, while those categorized as “other” (n = 17) were excluded. The exclusion criteria were as follows: duplicate videos, videos irrelevant to the search term (such as content about Teflon-coated cookware), videos shorter than one minute, non-Japanese language videos, and videos intended for dental hygienists. Videos targeting dental hygienists were excluded because the PEMAT-A/V, GQS, and mDISCERN instruments are designed to evaluate patient-facing educational materials. Including professional-only content would fall outside the intended scope of these tools and could bias assessments of understandability and actionability for the lay public. Additionally, one video was removed because it had been deleted for violating YouTube Community Guidelines. Finally, 84 videos were included. This study analyzed publicly available online content and did not involve human subjects. Institutional review board approval was, therefore, waived. The study was conducted in accordance with institutional guidelines and PRISMA principles adapted to digital media studies. Complete search terms (Japanese and English) are provided in Supplementary Table 1, and video classification criteria are detailed in Appendix A.
Data extraction
2.3
Basic characteristics
2.3.1
The following basic information was collected from each video: number of days since upload, total view count, number of likes, source, intended audience, and content. To assess viewership trends, the number of views was divided by the number of days since upload to calculate the average number of views per day.
We identified each video's source based on the YouTube channel that uploaded it. We categorized it into six groups: dentist/dental hygienist, local government, company, dental association, layperson, and unknown. “Company” referred to dental product manufacturers. “Unknown” was defined as videos with no clear author information. We also identified the intended audience of the videos. We judged them based on the video titles.
Content characteristics
2.4
The first author (HSN) assessed the intended audience for each video and classified them as either the general public or dental hygienists. Moreover, HSN examined video content to determine its stance on fluoride use for caries prevention, categorizing each video as pro, anti, or other based on its central message. To ensure systematic classification, the first author (HSN) categorized each video as pro, anti, or other. This process was guided by a predefined set of criteria (detailed in Appendix A) focusing on the video's central message, narration, and visual elements to minimize subjective judgment. Classification criteria included the video's overall stance as expressed in the narration, subtitles, and visual elements, for subsequent analyses of video characteristics and quality assessment using PEMAT-A/V, GQS, and mDISCERN.
Evaluation methods
2.5
Understandability and actionability
2.5.1
We used the validated Japanese version of PEMAT-A/V to rate two domains—understandability (12 items) and actionability (4 items)—for each video [37]. Items were scored dichotomously (agree = 1; disagree = 0); domain scores were computed as (sum of “agree” / number of applicable items) × 100, excluding items marked not applicable. Following prior applications, we interpreted a score of ≥70% as an acceptable benchmark for both domains [37]. Appendix C concisely summarizes domains, scoring, and thresholds (full item wording in Appendix B).
Natural flow and comprehensiveness
2.5.2
Each video was assessed using the GQS, a five-point Likert scale designed to evaluate the presented information's overall natural flow and comprehensiveness [26]. The GQS is widely used to evaluate health- and medicine-related video content. It comprises a single-item measure with scores ranging from 1 (poor quality) to 5 (excellent quality; Appendix D).
Reliability
2.6
We applied the mDISCERN items using their Japanese translation, which was adapted from the validated Japanese version of the original DISCERN [27]. The mDISCERN index was employed as a scoring system to assess the content's reliability and accuracy, using a five-point scale [27]. The score is based on five criteria: aim, presentation accuracy, balance, reference, and mentioning uncertainty. Higher total scores reflect greater overall reliability (Appendix E).
Statistical analysis
2.7
Two independent reviewers (HSN, a dentist; EF, a health communication professional) independently rated a random sample of 23 of 84 videos (27.4%) using the PEMAT-A/V, GQS, and mDISCERN tools. The sample was selected with a computer-generated random number table in Microsoft Excel. Agreement was assessed at the item level using Cohen's κ and Gwet's AC1, and at the scale level using intraclass correlation coefficients (ICC [1,2]; two-way random effects, absolute agreement). Discrepancies were resolved through discussion until a consensus was reached. After confirming acceptable agreement, the remaining videos were rated by HSN.
We used descriptive statistics to summarize the characteristics of the retrieved videos and calculate the PEMAT-A/V, GQS, and the mDISCERN index.
To examine attention on YouTube, we calculated each video's view rate—defined as the number of views divided by the number of days since upload (views/day). As the distribution of view rates was highly skewed, particularly among the pro-fluoride videos, which included some with exceptionally high view counts, we applied a log transformation to stabilize the variance and reduce the influence of extreme outliers. Specifically, we used the formula log(views/day +1) to avoid undefined values for zero and ensure that all values remained positive. Visual inspection also indicated that the variability (i.e., dispersion) in view rates was notably greater in the pro-fluoride group than in the anti-fluoride group. This heterogeneity makes it difficult to interpret group differences using raw values, further justifying the need for transformation. Statistical significance was defined a priori as p < 0.05 (two-tailed). No corrections for multiple comparisons were applied, as the analyses were exploratory. All analyses were performed in R version 4.1.1 (R Foundation for Statistical Computing). Descriptive statistics and group comparisons were conducted using base R functions. Inter-rater reliability was assessed using the ‘irr’ package (version 0.84.1) for Cohen's κ and Gwet's AC1, and the ‘psych’ package (version 2.1.9) for intraclass correlation coefficients. Spearman's rank correlations were computed using the base ‘stats’ package.
Results
3
Fig. 1 shows the PRISMA flow of video selection. Of the 84 eligible videos, 49 were pro-fluoride, 18 were anti-fluoride, and 17 were other. As Table 1 indicates, pro-fluoride videos were mainly uploaded by dentists/dental hygienists, while most anti-fluoride videos came from laypersons, with a smaller number from unknown sources or a single dental professional. Anti-fluoride videos also received more likes on average.Fig. 1PRISMA flow diagram depicting inclusion and exclusion criteria of YouTube videos.Fig. 1. Table 1Video characteristics.Table 1. ProAntiOtherBasic informationNumber of videos included491817Days since upload - maximum (days)264824374177Days since upload - minimum (days)157296563Days since upload - average (days)11939441465Number of views - maximum (views)2,475,818283,1224,111,810Number of views – minimum (views)9165816,731Number of views - average (views)151,21476,224588,462Views per day - maximum14253277303Views per day - minimum0.050.4516Views per day - average150111748Number of likes - maximum13,31943127671Number of likes - minimum0882Number of likes - average99215241781Video length - maximum (seconds)361611691247Video length - minimum (seconds)6616479Video length - average (seconds)605644541Video creatorsProAntiOtherDentists and dental hygienists29113Local government200Companies200Dental association100Layperson15154Unknown020Target audienceProAntiOtherGeneral public491817Dental hygienists000
As Table 2 shows, pro-fluoride videos scored significantly higher on reliability and overall quality (mDISCERN: 2.61 ± 0.95 versus 1.29 ± 0.57, d = 1.53, 95% CI 0.89–2.17; GQS: 2.46 ± 1.36 versus 1.00 ± 0.91, d = 1.22, 95% CI 0.60–1.84; both p < 0.001). By contrast, PEMAT-A/V scores for understandability (58.0% versus 52.8%) and actionability (60.2% versus 53.0%) did not differ significantly between groups. Fig. 2, which presents the distributions of PEMAT-A/V, mDISCERN, and GQS scores, illustrates group medians, interquartile ranges, and outlying values for pro- versus anti-fluoride videos. However, only about one-quarter of videos satisfied the PEMAT understandability threshold, and about one in 10 fulfilled the actionability threshold, underscoring limited understandability and actionability. Among pro-fluoride sources, videos from local government and the dental association scored highest on reliability, whereas among anti-fluoride sources, videos from unknown uploaders slightly outperformed those from laypersons.Table 2. Comparison of quality scores between pro- and anti-fluoride videos.⁎Table 2. Video FeaturesFluoride statusPro (n = 49)Anti (n = 18)p-valuePEMAT-A/VUnderstandability58.0 ± 16.952.8 ± 15.80.226Actionability60.2 ± 19.653.0 ± 20.00.332mDISCERN2.61 ± 0.951.29 ± 0.57<0.001⁎GQS2.46 ± 1.361.00 ± 0.91<0.001⁎GQS: Global Quality Score.Values are presented as mean ± standard deviation.The Mann-Whitney U test was used.⁎p < 0.001.Fig. 2. Distributions of scores by fluoride status. Boxplots show the distribution of (A) PEMAT-A/V Understandability, (B) PEMAT-A/V Actionability, (C) Global Quality Score (GQS), and (D) mDISCERN scores. Whiskers indicate the 1.5 interquartile range.Fig. 2
Table 3 presents correlations between content quality and engagement. Among pro-fluoride videos, no significant correlations were observed (e.g., GQS ρ = −0.24, p = 0.085). By contrast, among anti-fluoride videos, view rate showed a moderate to strong positive correlation with both understandability (ρ = 0.53, p = 0.028) and GQS (ρ = 0.53, p = 0.029), suggesting that clearer and better-structured anti-fluoride content tended to attract more viewers.Table 3. Spearmans rank correlation (ρ) between view rate and quality scores.Table 3. Pro (n = 49)Anti (n = 18)ρp-valueρp-valuePEMAT-A/V Understandability0.050.7300.530.028PEMAT-A/V Actionability−0.100.4700.410.100GQS−0.240.0850.530.029mDISCERN−0.150.2900.130.630
Appendix F presents the detailed PEMAT-A/V item-level results. Both pro- and anti-fluoride videos generally used everyday language and direct address effectively, but performed poorly on items requiring the use of tables, graphs, or diagrams. For example, in the pro-fluoride group, 87.0% of videos made their purpose completely evident from the beginning (Item 1) and 87.0% used common, everyday language (Item 3), whereas about 11% fulfilled the criterion for Item 18 (“use of tables with clear headings”). In the anti-fluoride group, 94.4% used everyday language (Item 3) and 94.4% had text on the screen that was easy to read (Item 12). Item 18 (“use of tables with clear headings”) was generally not applicable owing to the absence of tables (see Appendix F). These findings highlight the widespread lack of visual support across both groups.
Concerning inter-rater reliability, for PEMAT-A/V item ratings, Gwet's AC1 ranged from 0.48 to 0.83 (95% CIs in Appendix G); corresponding Cohen's κ values were lower, consistent with prevalence effects. At the scale level, ICCs were 0.76 for understandability and 0.51 for actionability. Appendix H reports agreement for the single-item GQS. Appendix I reports the item-level κ/AC1/ICC coefficients for mDISCERN; the global index showed moderate agreement (AC1 ≈ 0.46).
Discussion and conclusion
4
Discussion
4.1
This study evaluated 84 YouTube videos on fluoride use for caries prevention and compared pro- and anti-fluoride content using PEMAT-A/V, GQS, and mDISCERN. For PEMAT-A/V, mean understandability and actionability were slightly higher for pro-fluoride videos, but the differences were not significant. Within the pro group, videos posted by local governments and dental associations achieved particularly high mDISCERN scores, indicating stronger source credibility. By contrast, the quality of layperson-generated videos was more variable, suggesting that the creator's background influences clarity and actionability. Although prior work on webpages reported that anti-fluoride sites can be more readable than pro sites [19], direct comparison is limited because this study assessed audiovisual materials rather than text. The observed effect sizes for reliability and overall quality differences between pro- and anti-fluoride videos (Cohen's d = 1.53 for mDISCERN; d = 1.16 for GQS) represent large practical differences according to conventional interpretive benchmarks (d ≥ 0.8 indicating a large effect). These substantial effect sizes underscore that the reliability gap between expert-generated and lay-produced content is not merely statistically significant but also meaningfully large in magnitude, with important implications for public health messaging strategies.
On GQS, which reflects overall structure and coverage, and on mDISCERN, a reliability index, pro-fluoride videos significantly outperformed anti-fluoride videos, with large effect sizes indicating robust differences beyond statistical significance. Nevertheless, higher credibility and coherence did not automatically translate into greater reach. Among anti-fluoride videos, higher view counts were positively correlated with higher understandability and with higher GQS, implying that lower cognitive burden and smoother structure can drive engagement. These patterns align with the ELM: Evidentiary strength operates as a central cue, while audience engagement often relies on peripheral cues such as fluency, narrative style, and production choices. Interestingly, a notable characteristic within the pro-fluoride group was the use of vlog-style formats, particularly by laypersons. These videos, often featuring everyday clinic scenes or family routines, may not be primarily designed as expert information vehicles. However, such portrayals can shape attitudes through familiarity and empathy, potentially influencing behavior via the peripheral route [23]. This highlights a key tension: Even within the pro-fluoride camp, a divergence prevails between high-reliability content from experts and potentially more engaging, narrative-driven content from laypersons. As our findings suggest, these vlog-style formats—though effective for building empathy—may not always prioritize the high informational quality observed in expert-produced content, suggesting a potential strategy wherein health authorities could partner with trusted creators to produce expert-informed vlogs. However, such an approach necessitates careful design to leverage empathetic storytelling without trivializing the scientific information being conveyed. Even pro-fluoride videos fell short of recommended PEMAT-A/V thresholds: understandability, 56.4%; and actionability, 58.0%. Item-level patterns indicated limited use of concise tables and clear headings (PEMAT Item 18) and insufficient charts or graphs to prompt action (PEMAT Item 24). Overall, these findings underscore a persistent accuracy–engagement gap: Expert-made content tends to be more credible and coherent, yet content that is easier to process can spread more widely. Beyond the fluoride context, integrating content-quality metrics with platform analytics offers a replicable framework to study how accuracy, accessibility, and reach interact across digital health topics. Our findings align with reports from other medical domains. Studies on telemedicine and urologic immunotherapy have documented low understandability and especially low actionability on PEMAT-A/V, with only fair DISCERN and modest GQS scores, alongside persistent misinformation concerns [31,32]. Similar quality limitations have been observed for placenta accreta and PDE5 Inhibitor content, and prostate cancer mental health videos have yielded mixed quality despite covering high-need topics [32]. Collectively, these benchmarks suggest a recurring configuration in digital health: Expert-generated content generally achieves higher reliability, whereas materials that are easier to process—regardless of accuracy—typically gain greater reach. Our fluoride-related results align with this broader trend, reinforcing the need to pair evidence-rich central cues with fluency-oriented design features. Looking forward, our integrated framework, which combines PEMAT-A/V, GQS, and mDISCERN metrics with engagement analytics, may inform the development of algorithmic or semi-automated systems for scalable health content quality assessment. Future research could explore whether specific linguistic features, visual elements, or structural characteristics that predict high PEMAT-A/V or mDISCERN scores can be computationally detected through natural language processing or computer vision techniques. Such automated quality signals could potentially be incorporated into platform recommendation algorithms to promote evidence-based content more effectively, enabling real-time monitoring of health misinformation across large video corpora.
Exposure to online misinformation can cause parents to decline fluoride application for their children, and anti-fluoride beliefs have been associated with broader patterns of treatment refusal and vaccine hesitancy. These dynamics can elevate disease burden among children and communities, contribute to preventable suffering, and raise healthcare costs [22]. Addressing misinformation on widely used platforms is, therefore, a public health priority. Cultural and linguistic factors likely shape engagement with fluoride content in Japan. Population-based studies have indicated that comprehensive health literacy in Japan is lower than in Europe and heterogeneous across adults, underscoring the need to tailor materials for comprehension and action. Additionally, a validated Japanese version of the PEMAT demonstrates acceptable reliability and construct validity, supporting the use of understandability and actionability benchmarks in Japanese-language materials. Beyond measurement, randomized experiments manipulating processing fluency (e.g., typography, layout) have demonstrated that easier-to-read health materials increase perceived ease, self-efficacy, and behavioral intentions, suggesting that design choices directly enhance motivation to act. [10] Complementary evidence from systematic reviews has suggested that transparent visual aids reliably improve risk understanding and informed decision making across diverse audiences, including those with lower skills [11]. In sum, aligning credible, evidence-based messages with fluency-oriented design (captions, chaptering, concise graphics) is a plausible strategy to maximize both accessibility and persuasive impact in Japanese video health communication.
Dental caries has been described as a “canary in the coalmine” because of its close associations with socioeconomic determinants [9]. Individuals with lower health literacy are more likely to rely on potentially inaccurate sources of health information [8]. Ensuring that YouTube content about fluoride is accurate, comprehensible, and actionable is, thus, essential to mitigate disparities and reduce the disproportionate impact of oral disease on vulnerable populations.
Trusted sources—particularly local governments and dental associations—should continue to maintain high GQS and mDISCERN performance while systematically improving understandability and actionability. Design strategies that reduce cognitive load and guide concrete behaviors can bridge the accuracy–engagement divide. Examples include chaptered segments, concise infographics, and clear subtitles to reinforce narration, as well as step-by-step demonstrations of product selection, appropriate dosage, and use frequency to support behavior change [23]. Aligning central-route content (evidence and guidance) with peripheral-route supports (fluency, narrative structure, visual scaffolds) may increase both reach and sustained persuasion among audiences with diverse health literacy levels.
Limitations
4.1.1
This study had several limitations. First, video sampling was conducted based on specific search terms and search dates and, hence, did not represent the entirety of the fluoride-related content on YouTube. Sorting by total view count likely over-represented highly popular videos while under-representing less-viewed yet potentially influential content. Furthermore, our sample of 84 videos—though systematically selected—might not have fully captured the complete breadth of fluoride-related content on the platform. Second, because YouTube is dynamic, our findings should be interpreted as time-bounded; moreover, the interval between the initial search (October 15, 2023) and the focused review of top results (February 10, 2025) may have exposed the dataset to changes in ranking dynamics. This rapid evolution of content suggests that one-off analyses are insufficient; continuous monitoring of the information ecosystem and potential collaborations with platforms to promote reliable content may be necessary to mitigate the spread of misinformation. Third, we did not model YouTube's recommendation algorithms, which strongly influence visibility and engagement. Fourth, although standardized tools were used, rater subjectivity cannot be completely excluded. Inter-rater agreement for GQS (a single-item global rating) and for the mDISCERN index was moderate, which tempers the strength of inferences drawn from these scales; results were interpreted cautiously. Fifth, viewer characteristics (e.g., age, health literacy, prior attitudes) were not assessed, limiting generalizability across subgroups. Finally, analyses were restricted to Japanese-language videos and were observational in nature; causal inferences regarding how video features drive engagement cannot be drawn. Future research should involve more detailed qualitative analyses examining how pro- and anti-fluoride content influences viewer attitudes and behaviors.
Innovation
4.2
This is the first study to integrate the PEMAT-A/V, GQS, and mDISCERN with YouTube analytics for Japanese fluoride content. By integrating established quality metrics with platform data, it provides a data-driven framework to inform algorithm-aware oral-health communication strategies.
Conclusion
4.3
This study evaluated the quality of pro- and anti-fluoride YouTube videos on fluoride use, using the PEMAT-A/V, GQS, and mDISCERN to assess their understandability, actionability, natural flow and comprehensiveness, and reliability. Although pro-fluoride videos were rated highly for structure and reliability, they did not necessarily attract many views. By contrast, some anti-fluoride videos containing misinformation received more views when they scored well in terms of understandability and natural flow.
The dissemination of health information must prioritize not only scientific accuracy but also strategic communication approaches that convey evidence-based content in ways that are accessible and actionable for diverse audiences. To translate these findings into practice, public health authorities and professional dental organizations should invest in communication strategies that are not only evidence-based but also algorithm-sensitive. In short term, collaborations with digital platforms (e.g., YouTube) may help increase the visibility of reliable videos. In the long term, capacity building efforts, such as training health communication specialists, are needed to develop engaging, audience resonant storytelling while maintaining scientific accuracy. Such measures could enhance trustworthy oral health information's visibility and impact, thereby counteracting misinformation and supporting equitable disease prevention at the population level. Future work could also extend this approach to AI-assisted screening, for example using natural language processing or computer vision to detect features associated with higher quality content at scale. Ultimately, the integrated methodological framework employed herein—particularly the combination of validated quality metrics with platform analytics—offers a replicable model for future research to assess and enhance health communication across a broad range of topics on digital platforms.
CRediT authorship contribution statement
Hikari Sophia Nagao: Visualization, Investigation, Writing – review & editing, Writing – original draft. Tsuyoshi Okuhara: Validation, Supervision, Project administration. Emi Furukawa: Validation, Writing – review & editing. Hiroko Okada: Supervision, Writing – review & editing. Takahiro Kiuchi: Supervision.
Funding sources
This work was supported by the 10.13039/501100001691Japan Society for the Promotion of Science KAKENHI (25K13483).
Declaration of competing interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Madathil K.C.Rivera-Rodriguez A.J.Greenstein J.S.Gramopadhye A.K.Healthcare information on You Tube: a systematic review Health Informatics J 21320151731942467089910.1177/1460458213512220 · doi ↗ · pubmed ↗
- 2Ministry of Internal Affairs and Communications 2023 Survey on media usage time and information behavior: Ministry of internal affairs and communications Available from:https://www.soumu.go.jp/main_content/000953019.pdf 2023
- 3Maeoka M.Think with Google. Using You Tube as a marketing guide — trends emerge and multi-formats become part of everyday life: Makoto Maeoka Available from:https://www.thinkwithgoogle.com/intl/ja-jp/marketing-strategies/video/yt-trendsreport 2022/
- 4Morena N.Htite E.D.Ahisar Y.Hayman V.Rentschler C.A.Meguerditchian A.N.Breast cancer vlogs on You Tube: descriptive and content analyses JMIR Infodemiol 52025 e 6681210.2196/66812 PMC 1197511840163648 · doi ↗ · pubmed ↗
- 5Birch E.M.Leziak K.Jackson J.Dahl E.Niznik C.M.Yee L.M.Content quality of You Tube videos about gestational diabetes: systematic evaluation JMIR Diabetes 722022 e 3015610.2196/30156 PMC 903090935389355 · doi ↗ · pubmed ↗
- 6Işikman N.G.Coping with hard times through video narratives Int J Commun Res 1232022205210
- 7Moorhead S.A.Hazlett D.E.Harrison L.Carroll J.K.Irwin A.Hoving C.A new dimension of health care: systematic review of the uses, benefits, and limitations of social media for health communication J Med Internet Res 1542013 e 8510.2196/jmir.1933 PMC 363632623615206 · doi ↗ · pubmed ↗
- 8Chen L.Yang X.Nature and effectiveness of online social support for intercultural adaptation of mainland Chinese international students Int J Commun 9201521
