How Intense Is Effective? Exploring Aerobic Exercise Intensity for Knee Osteoarthritis Through a Bayesian NetworkMeta-Analysis
Luis Cabrera-Durán, Javier Palomares-Fernández, Ignacio Canitrot-González, Paride Crisafulli, Carlos Donato Cabrera-López, José Fierro-Marrero

TL;DR
This study examines the effectiveness of different aerobic exercise intensities for knee osteoarthritis but finds insufficient evidence to determine the best intensity.
Contribution
The novel contribution is a Bayesian network meta-analysis exploring aerobic exercise intensity effectiveness for knee osteoarthritis.
Findings
Comparisons between aerobic exercise intensities showed non-significant, imprecise results.
Evidence certainty was rated as very low due to high risk of bias and indirect comparisons.
Clinicians are advised to use a patient-centered approach and combine aerobic exercise with other interventions.
Abstract
Introduction: Knee osteoarthritis (KOA) is characterized by pain, stiffness, and functional limitation. Aerobic exercise (AE) is a key treatment with proven benefits. Aims: To determine the most effective AE intensity for KOA. Methods: Searches were performed in seven databases, including randomized controlled trials with AE-only groups. Outcomes assessed were pain, walking and sit-to-stand performance, stiffness, and disability. Methodological quality and bias were evaluated. Bayesian random-effects network meta-analyses compared AE intensities, reporting standardized mean differences (Hedges’ g) with 95% credible intervals. Certainty of evidence was rated using GRADE. Results: Fifteen studies were included (mean PEDro score: 5.93), showing “some concerns” or “high risk” of bias. Two meta-analyses (pain and walking performance) were conducted. Comparisons between AE intensities showed…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5- —“la Caixa” Foundation
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOsteoarthritis Treatment and Mechanisms · Total Knee Arthroplasty Outcomes · Lower Extremity Biomechanics and Pathologies
1. Introduction
According to the Osteoarthritis Research Society International, osteoarthritis (OA) is a disease of articular cartilage characterized by progressive degradation. OA first manifests as a molecular disorder (abnormal joint tissue metabolism) and later as an anatomical and/or physiological disorders, such as cartilage degradation, bone remodeling, osteophyte formation, joint inflammation, and loss of normal joint function [1]. Symptoms vary in intensity but usually worsen over time. The most common are pain, stiffness, and swelling. Disease severity can be classified radiologically using the Kellgren–Lawrence system [2].
Knee osteoarthritis (KOA) has a global prevalence of 22.9% of individuals over 40 years, diagnosed through radiological or symptomatic–radiological criteria [3]. The global incidence is 203 per 10,000 person-years among individuals aged 20 and above, higher for radiological than symptomatic diagnosis. Both prevalence and incidence increase with age and are higher in women [3]. KOA generates relevant healthcare costs, reaching an average annual cost of 2295 € per patient, rising with joint degeneration [4]. In the United States, costs can reach 10,000 € per patient, a difference potentially attributed to the higher obesity prevalence [5].
Current clinical practice guidelines recommend a multimodal approach in KOA, combining exercise, education, and weight control [6]. Notably, all six of the most recent clinical practice guidelines strongly recommend the use of exercise therapy regardless of age, pain, or osteoarthritis severity, with several guidelines recommending exercise as a first-line intervention [6]. Exercise therapy is considered safe, as different exercise modalities are not associated with an increased risk of adverse events [7].
Despite this strong recommendation, current guidelines do not provide information on how exercise therapy should be prescribed, particularly with respect to modality selection and dosage parameters [6]. Although the clinical benefits of exercise for KOA are well supported, key questions regarding optimal prescription remain unresolved [6].
Exercise is a cost-effective intervention, yet uncertainty persists regarding the most effective exercise modality and its optimal dosage. Recent research has examined a range of exercise modalities, including aerobic, resistance, mind–body, flexibility-based programs, and their combinations, through several time-points [7]. A generalized finding observed in this meta-analysis concerns that exercise modalities seem to modify pain intensity and functioning. However, results are imprecise for its effectiveness in quality of life, remaining uncertain. When checking for the specific effects of modalities, the majority seem to provide consistent effects on pain and functioning when interventions are provided for approximately 12 weeks, with uncertainty for its effectiveness in short and long interventions (aprox. 4 and 24 weeks respectively). However, AE seems to be a promising modality as we currently consider evidence to support its effectiveness with short-term interventions, being the only one to show consistent findings at this time point [7].
Among these modalities, AE has been highlighted as particularly promising due to its additional cardiovascular and anti-inflammatory benefits, which may be especially relevant in populations with KOA and frequent comorbidities [8]. Accordingly, several studies have begun to explore exercise dosage parameters using frameworks such as the FITT-VP model, examining factors such as exercise intensity, session duration, weekly frequency, intervention length, and supervision. For example, Juhl et al. [9] reported that supervised exercise was associated with improved outcomes in certain modalities, including AE, whereas no specific dosage parameters significantly influenced the effects of resistance exercise in populations with KOA, with and without hip OA.
Nevertheless, evidence regarding the role of exercise intensity remains inconsistent. A recent meta-analysis reported no clinically significant differences according to exercise intensity in either aerobic or resistance modalities [10], though findings were characterized by substantial heterogeneity and imprecision.
Importantly, no meta-analysis to date has specifically evaluated the clinical effectiveness of isolated AE across different intensity levels in isolated KOA populations. Given the widespread recommendation, safety profile, feasibility, and scalability of AE, clarifying whether exercise intensity influences its effectiveness represents a relevant and unresolved research question. The present study proposes to fill of this gap by conducting a network meta-analysis, where different categories of AE intensities will be compared directly and indirectly through a network meta-analysis.
To address this gap, the aim of the present study is to compare the effects of different AE intensities on clinical and functional outcomes exclusively in KOA populations. We conducted a Bayesian network meta-analysis, which allows the integration of both direct and indirect comparisons between multiple intensity categories.
2. Methods
This systematic review and network meta-analysis (NMA) followed the Preferred Reporting Items for Overviews of Systematic Reviews and Meta-analysis Network Meta-analysis Extension Statement [11]. The protocol was also registered in PROSPERO (CRD42024519632 on 12 March 2024).
2.1. Selection Criteria
Selection criteria were based on the PICOS strategy.
2.1.1. Population
Patients with KOA were eligible regardless of the diagnostic criteria. Studies combining KOA and other concomitant musculoskeletal pathologies, such as hip OA, were excluded.
2.1.2. Intervention
Eligible interventions should solely consist of AE alone. Studies combining exercise with any other intervention such as lifestyle recommendation or educational protocols were excluded. Studies should include internal load parameters of AE intensity, such as heart rate (HR), oxygen consumption measures, or subjective internal intensity parameters like perceived exertion or talk test.
2.1.3. Comparison
Any type of comparator was eligible for inclusion. Comparison groups would serve as connectors through the network.
2.1.4. Outcome Measures and Time-Point Analysis
The following outcome measures were considered for inclusion: (1) knee pain intensity; (2) performance in walking tasks; (3) performance in sit-to-stand tasks; (4) performance in combined sit-to-stand and walking tasks (timed-up-and-go or similar); (5) perceived knee stiffness; (6) disability related to KOA, exclusively analyzed with the total points in the Western Ontario McMaster Universities Osteoarthritis Index (WOMAC). Only immediate post-intervention analyses were eligible.
2.1.5. Data Availability and Study Design
Eligible studies should present sample size, central tendency, and dispersion parameters for the outcomes and time-point of interest. Only randomized controlled trials were eligible.
2.2. Search Strategy
Systematic searches were conducted using PubMed, EBSCO, Web of Science, SciELO, ScienceDirect, Scopus, and Google Scholar to find eligible studies across multiple databases. Search engines, search equations, databases, number of registries retrieved, and search dates are reported in the Supplementary Material. Additionally, non-systematic manual and citation searches were performed.
2.3. Selection Process
The selection process was conducted by two researchers (LCD and ICG) using Rayyan.ai [12]. Duplicates were identified automatically and manually removed. Reviewers independently and blindly screened records in two stages. For the first stage, Title–Abstract–Keywords–Design were analyzed. A pilot test with the first 60 records refined criteria: the first 30 were analyzed, discussed with the group, reanalyzed, and then reapplied the discussed criteria to records 31–60, which were additionally reanalyzed. The remaining records were screened, with final disagreements resolved by discussion of both reviewers or with a third reviewer when needed. In the second stage, records selected for full text analysis were reassessed. Final disagreements were resolved by discussion of both reviewers or with a third reviewer when needed.
Additional searching methods, such as manual and citation searches, were conducted collaboratively by the reviewers.
2.4. Data Extraction
2.4.1. Summary Information
The summary information would include authors, study design, KOA diagnostic criteria and the presence of pain. Additionally, the BMI, diabetes, and blood pressure values would be provided. Experimental (aerobic) and control groups (other interventions), allocated and analyzed sample size, age, and sex within groups would be provided.
Only the previous six outcome measures of interest would be extracted, along with their assessment tools. The results of the comparison between experimental groups or experimental with control groups for post-intervention analysis would be extracted. Summary results of this information were presented narratively and schematically indicating the direction of the effect (>, <, or ≈between groups).
2.4.2. Exercise Prescription
AE prescription parameters were extracted. These include the exercise activity (walking, cycling, etc.), and density features (intervallic or continuous modalities). In addition, volume parameters would be reported, including:
- Total session duration (including exercise and rest periods).
- Stimuli duration (specific time spent exercising).
- Weekly frequency.
- Number of weeks.
Additionally, the intensity of AE, monitorization instruments, and the intensity progression parameters would be reported. Exercise intensity parameters would be categorized into “near-maximal”, “vigorous”, “moderate”, “light”, “very light”, or any of their combinations (such as very-light-to-light, etc.) following the criteria by the American College of Sports Medicine [13] for percentage of maximum heart rate (%HRmax), percentage of maximum heart rate reserve (%HRR), percentage of maximal oxygen consumption (%VO_2_max), percentage of oxygen consumption reserve (%VO_2_reserve), and Borg 6–20 scale, see Supplementary Material. Estimations for other instruments such as the Borg CR-10 would be conducted based on the conversions proposed by Borg [14]. Talk test zones would be identified [15] to estimate HR or oxygen consumption measures, and consequently the intensity [16].
2.4.3. Data Extraction Procedure
Three researchers independently and blindly extracted the following information: (1) type of intervention of any group; (2) AE intensity prescription; (3) categorization of exercise intensity into “near-maximal”, “vigorous”, “moderate”, “light”, “very light”, or their combinations; (4) outcome measures of interest, and the instruments/tests employed; (5) sample size analyzed, central tendency and dispersion results. The agreed upon information was considered valid, included in the summary table, and used for meta-analyses. Disagreements would be resolved between reviewers. The remaining information was extracted collaboratively.
2.5. Methodological Quality and Risk of Bias
Two reviewers would blindly and independently assess the methodological quality using the PEDro scale, and the risk of bias using the Risk of Bias Tool 2.0 [17,18]. Agreed on items would be maintained, while disagreements were resolved by the group’s decision.
Inter-rater agreement was tested for each item within every study employing a quadratic weighted Cohen’s kappa coefficient (κ). The interpretation was “almost perfect” for 0.81–1.00; “substantial” if 0.61–0.80; “moderate” if 0.41–0.6; “fair” if 0.21–0.4; “slight” if 0.00–0.20; and “poor” when < 0.00 [19]. The function “cohen.kappa” was employed from the “psych” package version 2.3.12 [20] in R Software version 4.3.1 [21].
2.6. Network Meta-Analysis
A network meta-analysis (NMA) was conducted to compare the efficacy of AE across different intensity levels, with “no treatment” serving as the primary control node. Experimental nodes represented specific aerobic exercise intensities classified as near-maximal, vigorous, moderate, light, very light, or any combination of these, following previously established methodology.
Standardized mean differences were calculated using Hedges’ g [22] and interpreted as follows: very small (<0.20), small (0.20–0.49), medium (0.50–0.79), and large (≥0.80) [23]. Two Bayesian NMA models were implemented:
- Model 1: A Bayesian model including only the intervention effect (no covariates).
- Model 2: An adjusted Bayesian model including the intervention effect plus two study-level covariates: intervention duration (number of weeks) and weekly frequency.
The Bayesian hierarchical NMA was implemented using a custom Python script (version 3, see Supplementary Material) whose main class is based on the PyMC3 module [24].
The Bayesian models would produce three main outputs:
- Network graph indicating the direct connections established within each NMA.
- Forest plot comparing exercise interventions against the no treatment control.
- Pairwise comparison between interventions, reported as network (mixed), direct, and indirect estimates. Inconsistency between direct and indirect evidence was tested with a two-tailed z-test.
- Funnel plots to assess potential publication bias.
A detailed view of the meta-analysis methodology, Hedges’ g calculation, Bayesian models, and the plot generation processes can be found in the Supplementary Material.
2.7. Synthesis of Results
The synthesis of results will be presented with the adapted Cochrane GRADE (Grading of Recommendations Assessment, Development and Evaluation) of evidence for NMA [25]. This framework allows drawing conclusions on the certainty of the available evidence provided by the NMA, based on a stepwise evaluation of direct, indirect, and mixed evidence. The certainty is first analyzed respectively for direct and indirect evidence (most dominant loop), with both being tested for risk of bias, heterogeneity, indirectness and publication bias. Additionally, indirect evidence is tested for intransitivity. Intransitivity was assessed qualitatively following GRADE indications, checking for the presence of potential modifiers across direct comparison and indirect loops, such as different populations, interventions, or outcome measures. The final analysis from mixed evidence starts considering the certainty of the most dominant contribution of either direct or indirect evidence. Then, it is assessed for incoherence (z-test between direct and indirect estimates) and imprecision.
Each domain is judged as “not serious”, “serious”, or “very serious”, leading to downgrading the certainty by 0, 1 or 2 levels respectively. Limitations across domains result in an overall rating of the certainty of evidence as high, moderate, low or very low.
3. Results
3.1. Selection Process
A total of 6134 records were retrieved from the databases. After removing duplicates, 3216 records were screened for Title–Abstract–Keywords–Design. After resolving discrepancies, 207 records were analyzed in full text. A total of 15 studies [26,27,28,29,30,31,32,33,34,35,36,37,38,39,40] were included in the review, see Figure 1.
3.2. Summary Information
Diagnostic criteria varied across studies, with 12 studies selecting patients through both clinical and radiological criteria [26,27,28,30,31,32,35,36,37,38,39,40], only clinical [29], only radiological [34], or with an unstandardized procedure (diagnosed by a surgeon) [33].
The presence of pain was explicitly confirmed in nine studies [27,28,30,31,32,35,36,37,38]. In five other studies, the presence of pain was likely but not clearly stated (inferred from baseline pain assessments) [26,29,34,39,40]. One study did not provide clear information for the presence of pain [33].
A total of 455 patients were assigned to AE groups. Among them, 30 patients participated in a very-light-to-light intensity AE [40]; 20 were assigned to light-to-moderate intensity exercise [35]; 22 were assigned to light-to-moderate-to-vigorous exercise [30,31]; 31 participated in moderate intensity exercise [33,38]; 235 were assigned to a moderate-to-vigorous intensity exercise [27,32,34,36,37]; and 117 engaged in vigorous intensity exercise [26,28,29,33,35,39].
A total of 576 subjects were assigned to control groups. Among these, 259 subjects were allocated to resistance exercise programs [28,30,31,32,36,37,39]; 171 to education-based interventions [30,31,32,36]; 25 to a combined exercise program [34]; 24 to resistance exercise combined with behavioral and lifestyle modification strategies [34]; and 97 to no-treatment groups [27,37,38,39].
AE intensity was monitored using several procedures such as %HRmax [26,27,30,31,34,37,38,40], %HRR [32,35,36,39], the talk test [33], the Borg Rating of Perceived Exertion (6–20 points) [28,30,31,40], or the Borg CR10 scale [29].
The mean number of weeks for AE protocols was 16.9 ± 21.5 (range of 6–79), with a median of 10 (Q1 = 8, Q3 = 13), and a mode of 8 weeks. The weekly frequency of AE sessions was 2.93 ± 0.52 (2–4 range) with a median and mode of 3 sessions per week (Q1 = 3, Q3 = 3).
Twelve studies measured pain intensity [26,28,29,30,31,32,34,36,37,38,39,40], 10 explored walking performance [26,27,29,30,32,33,35,38,39,40], four analyzed sit-to-stand performance [30,33,35,39], three studies analyzed combined walking and sit-to-stand performance [27,33,40], and only five explored perceived knee stiffness [26,29,30,38,39] and disability associated with KOA [29,30,33,38,39]. See Supplementary Material for tabular information of each study included.
3.3. Methodological Quality and Risk of Bias
Of the 15 evaluated studies, the average score on the PEDro scale was 5.93 points (ranging from 4 to 8), with moderate-to-perfect reliability. Regarding the ROB 2.0 scale, for pain intensity, eight articles were rated as having “high risk” of bias [26,28,32,34,36,38,39,40], and three presented “some concerns” [29,30,37], extracted with slight-to-perfect reliability.
For walking performance, 10 studies presented “high risk” of bias [26,27,31,32,33,35,36,38,39,40], while one study presented “some concerns” [29], with slight-to-perfect reliability.
All studies evaluating sit-to-stand performance presented a “high risk” of bias [31,33,35,39], determined with slight-to-substantial reliability.
Similarly, all studies assessing combined sit-to-stand and walking performance were rated with “high risk” of bias [27,33,40], with a reliability ranging from fair to moderate.
For the perceived stiffness, three studies showed “high risk” of bias [26,31,38], while one presented “some concerns” [29], with a reliability ranging from fair to substantial.
Finally, regarding disability related to KOA, four studies were rated with a “high risk” of bias [31,33,34,38], while one study presented “some concerns” [29]. Inter-rater agreement ranged from fair to substantial. See Supplementary Material for detailed analysis of PEDro and ROB 2.0.
3.4. Network Meta-Analysis
Two meta-analyses were conducted: one focused on pain intensity, and the other on walking performance. Although six outcomes were initially considered, the remaining four could not be analyzed due to insufficient connectivity between studies in the network. Details on the eligibility process for conducting the meta-analyses, data availability, instrument selection, and the estimations into means and SD are presented in the Supplementary Material.
3.4.1. Pain Intensity
Seven studies were included in the meta-analysis [28,30,32,34,37,38,39]. Studies explored light-to-moderate-to-vigorous [30], moderate [38], moderate-to-vigorous [32,34,37] and vigorous intensities [28,39]. No direct comparisons between AE intensities were available. Control groups included educational protocols [30,32], resistance exercise [28,30,32,37,39], no treatment [38], and multimodal exercise (resistance, aerobic and stretching) interventions [34]; see Figure 2.
The two NMA models revealed non-significant and imprecise effects for all AE intensities, including light-to-moderate-to-vigorous (Model 1: g = −0.27; 95% CrI −3.19, 2.67; Model 2: g = −0.27; 95% CrI −3.19, 2.78), moderate (Model 1: g = −0.65; 95% CrI −4.09, 2.99; Model 2: g = −0.59; 95% CrI −7.07, 5.98), moderate-to-vigorous intensities (Model 1: g = −0.42; 95% CrI −2.49, 1.56; Model 2: g = −0.42; 95% CrI −2.48, 1.59), and vigorous (Model 1: g = 0; 95% CrI −2.19, 2.17; Model 2: g = 0; 95% CrI −2.17, 2.19) compared to “no treatment”. Heterogeneity was large in both models—Model 1 (τ = 1.40) and Model 2 (τ = 1.40). Model 1 forest plot is presented in the Supplementary Material; see Model 2 forest plot in Figure 3.
The addition of both covariates (weekly frequency and number of weeks) did not meaningfully adjust the precision of AE intensity estimates. In addition, neither weekly frequency (w_f_ = −0.15; 95% CrI −19.15, 18.28) nor number of treatment weeks (w_n_ = −0.09; 95% CrI −19.34, 19.30) showed a linear association with the observed effects.
Comparisons between different intensities of AE were based exclusively on indirect evidence. All estimated effects from both models were non-significant, with 95% CrI exceedingly wide in both directions, including large effect sizes (g > 0.7). This indicates a high degree of imprecision, limiting reliable conclusions about the true effect of AE intensities; see Table 1.
The preliminary GRADE assessment of the indirect network for both models identified “very serious” concerns regarding risk of bias, but “not serious” for indirectness. Heterogeneity was rated as “very serious”, whereas intransitivity was judged as “not serious”. Publication bias across the indirect network ranged from “not serious” to “very serious” depending on the comparison, see Supplementary Material with funnel plots. For the mixed estimates, imprecision was judged as “very serious” across all comparisons. Consequently, the final GRADE evaluation indicated an overall certainty of evidence of “very low” for all comparisons of AE intensities in both models, see Table 1.
3.4.2. Walking Performance
Seven studies were included in the meta-analysis [27,31,32,33,35,38,39]. Studies explored AE interventions of light-to-moderate [35], light-to-moderate-to-vigorous [31], moderate [33,38], moderate-to-vigorous [27,32], and vigorous intensities [33,35,39]. Two direct comparisons between intensities were available: light-to-moderate vs. vigorous [35]; and moderate vs. vigorous [33].
Control groups included resistance exercise [28,31,32,34,37,39], educational protocols [31,32,39], and no treatment [27,38]; see Figure 4.
The two NMA models revealed non-significant and imprecise effects for all AE intensities, including light-to-moderate intensity (Model 1: g = 0.78; 95% CrI −2.31, 3.88; Model 2: g = 0.33; 95% CrI −1.34, 1.96), light-to-moderate-to-vigorous intensity (Model 1: g = 1.22; 95% CrI −1.12, 3.87; Model 2: g = 0.25; 95% CrI −1.10, 1.62), moderate AE (Model 1: g = 0.89; 95% CrI −2.31, 2.94; Model 2: g = 0.06; 95% CrI −1.82, 1.91), moderate-to-vigorous intensity (Model 1: g = 1.82; 95% CrI −0.21, 4.21; Model 2: g = 0.45; 95% CrI −0.86, 1.72), and vigorous intensity (Model 1: g = 0.91; 95% CrI −0.63, 2.61; Model 2: g = 0.46; 95% CrI −0.42, 1.32), compared to “no treatment”. Heterogeneity was large in Model 1 (τ = 0.89) and decreased to small in Model 2 (τ = 0.30). Model 1 forest plot is presented in the Supplementary Material. See Model 2 forest plot in Figure 5.
The inclusion of both covariates (weekly frequency and number of weeks) slightly improved the precision of AE intensity estimates. Weekly frequency (w_f_ = 2.68; 95% CrI 0.68, 4.62) showed a significant positive effect on walking function, but the 95% CrI limits confidence in this result. The lower bound already suggests a large effect (0.68 per additional day), and the upper bound appears implausible. The number of weeks (w_n_ = −0.35; 95% CrI −0.8, 0.1) showed a non-significant and imprecise association with the observed effects.
Comparisons between AE intensities relied mainly on indirect evidence. Only light-to-moderate vs. vigorous had direct evidence [35], while moderate vs. vigorous presented both direct and indirect evidence [33].
Both models presented non-significant and imprecise results for pairwise comparisons, with 95% CrI, exceeding in both directions with large effect sizes (g > 0.7), preventing drawing clear conclusions about the real difference between AE intensities.
All estimated effects from both models were non-significant, with 95% CrI exceedingly wide in both directions, including large effect sizes (g > 0.7), preventing drawing clear conclusions about the effect of AE intensities; see Table 1.
A preliminary GRADE assessment of the direct networks identified “very serious” concerns for risk of bias, while heterogeneity and publication bias were rated as “not serious”, since only one study contributed to each comparison. For the indirect networks, risk of bias was judged as “very serious”, indirectness as “not serious”, and heterogeneity as “very serious” in Model 1; ranging from “not serious” to “very serious” in Model 2. Publication bias also ranged from “not serious” to “very serious”, while intransitivity was consistently judged as “not serious” in all comparisons; see Table 1 and Supplementary Material. For the mixed NMA estimates, imprecision was rated as “very serious” in both models. Final GRADE evaluation indicated “very low” certainty of evidence for all comparisons; see Table 1.
4. Discussion
This NMA aimed to identify the most effective AE intensity for KOA across six clinical and performance outcomes. While the analytical framework was designed to provide comparative estimates between AE intensities, the findings primarily reflect the current limitations of the available literature, rather than definitive effects of intensity.
Fifteen studies [26,27,28,29,30,31,32,33,34,35,36,37,38,39,40] were included in this review, but only two outcomes (pain intensity and walking performance) could be meta-analyzed. Neither analysis provided clear conclusions due to the wide CrI and a high risk of bias, mainly driven by: (1) low statistical power from few studies per intensity category; and (2) predominant reliance on indirect comparisons. Importantly, these results should not be interpreted as evidence of ineffectiveness, but rather as an indication that the existing literature does not yet allow reliable estimations of the real AE intensity effects. This limitation is driven not only by indirectness, but also by the high risk of bias observed across nearly all included trials, which fundamentally undermines the validity of any pooled estimate.
Given these findings, it remains highly relevant to examine AE in great detail and to assess whether exercise intensity is a determinant of its effectiveness. This relevance is supported by four key considerations: (1) recent meta-analytical evidence indicates that AE is effective, with precision and in the short term, (around 4 weeks) on pain and functional outcomes on KOA, whereas evidence for other exercise modalities remains uncertain at this time-point [7]; (2) the benefits of AE appear to be maintained at approximately 12 weeks of intervention, although presents uncertainty at longer intervention periods (around 24 weeks) [7]; (3) exercise therapy, and AE in particular, is consistently and strongly recommended by clinical guidelines in KOA [6]; (4) compared with other modalities, AE has high feasibility and scalability, making it easy to conduct increments in its intensity [41].
Consequently, distinguishing between close intensities (e.g., moderate vs. moderate-to-vigorous) remains a critical but unresolved issue. Pain outcomes may be particularly imprecise due to their multifactorial nature. Although performance outcomes might be expected to improve with intensity, higher intensities may also exacerbate symptoms by increasing risk flare-ups, as reported in some populations through the exposure to certain physical activities [42]. The scope to answer this question is limited by insufficient reporting of dropouts, and the scarce methods to mitigate attrition-related bias (e.g., data-imputation). These concerns were reflected in ROB Domain 2 with pain studies judged with “some concerns” or “high risk” of bias [28,32,34,38,39], as did all studies in walking performance. A major contributor to this assessment was the inherent lack of participant and personnel blinding in exercise-based interventions, which may have inflated perceived treatment effects, particularly in trials were subjects were compared to a usual care or no treatment condition.
To our knowledge, this is the first NMA comparing AE intensities in KOA. A previous meta-analysis including strength training found high-intensity exercise to be superior to low-intensity in pain and function, though not clinically relevant [10]. Other studies comparing AE and resistance exercise found no clear superiority [43].
Although AE alone has been considered effective for KOA, this conclusion should be interpreted with caution and extended to other exercise modalities. Limitations consistently observed across most systematic reviews include: (1) including patients not with KOA exclusively (most frequently with hip OA) [10,44,45,46]; (2) no standardized AE definitions [9,44,45]; (3) frequent combination with other interventions (physiotherapy, lifestyle recommendations, education, etc.) [9,44,45]; (4) controversial risk of bias assessment [45]; (5) low-certainty evidence by GRADE [10]; and (6) large confidence intervals leading to imprecision [9]. This review applied strict selection criteria, but few studies were identified, questioning whether AE still provides high-quality evidence as previously claimed [47].
Mechanistically, AE has multiple plausible effects on KOA. It may modulate central pain pathways by enhancing serotonin release, neuroplasticity via BDNF, and other neurotrophic agents [48,49]. It may also reduce systemic and articular inflammation, lowering cytokines implicated in KOA [50,51]. Preclinical KOA models show AE decreases IL-1β, caspase-3, and MMP-13, preventing cartilage breakdown and possibly slowing disease progression [52,53,54]. Functionally, AE can improve gait, endurance, and mobility, which may occur independently of pain changes [44]. Psychologically, AE may reduce stress, anxiety, and sleep disturbance, indirectly improving their clinical state [55,56].
Together, these findings support AE as a promising intervention for KOA. Despite uncertainty about its isolated effectiveness and optimal intensity, current evidence suggests clear potential benefits.
4.1. Limitations
This study presents several limitations. Firstly, few studies met the inclusion criteria, mainly due to population characteristics, insufficient detail in AE prescription, and scarcity of trials assessing isolated AE. The small number of studies reduced statistical power to detect differences between AE intensities. Secondly, there is a lack of studies directly comparing AE intensities; therefore, most evidence proceeded from indirect evidence. Thirdly, many studies presented high risk of bias.
4.2. Clinical Implications
AE seems to be a promising intervention for KOA; however, evidence derived from studies assessing isolated AE interventions in KOA remains limited. Current results, in addition to the support of other meta-analyses, suggest potential benefits, though it remains uncertain whether AE intensity significantly influences pain intensity or walking performance. In this context, clinical decisions should follow a pragmatic approach. Specifically, clinicians may consider the following points when prescribing AE in KOA: (1) assess the patient’s history on their clinical response to physical activity, including pain exacerbation and joint swelling; (2) based on this assessment, consider initiating AE using modalities with reduced articular loading (e.g., aquatic exercise, cycling, partial body-weight-supported exercises); (3) begin with moderate-intensity continuous exercise, and progress according to patient’s preferences, tolerance, and response to moderate-to-vigorous continuous or intervallic exercise; (4) preferably monitor AE intensity using direct physiological measures, with %HRR recommended due to its bond with resting heart rate; when direct measures are not feasible, simple and validated tools such as the talk test, Borg CR10, or Borg 6–20 scales bay be used; (5) as most interventions employ a frequency of three sessions per week, this frequency is likely to provide the greatest certainty of benefit; (6) based on evidence from existing meta-analyses, improvements in pain and physical function may be expected after approximately four weeks of intervention, with greater certainty around twelve weeks of exercise [7]; (7) AE should ideally be integrated within a multimodal management strategy, including medication control, physiotherapy, health education, and weight management.
While the optimal target intensity of AE remains undefined, this structured approach should allow clinicians to balance safety and potential benefit in the face of current evidence uncertainty. Medium-to-long-term AE programs may also provide added value by addressing common comorbidities such as overweight, hypertension, insulin resistance, and metabolic syndrome.
5. Conclusions
Very low certainty of evidence limits drawing precise conclusions from the effects of AE intensity on pain severity and walking performance in knee osteoarthritis. The available comparisons were characterized by concerning risk of bias, high heterogeneity, certain publication bias, and a marked imprecision of credible intervals. Overall, these findings indicate not an absence of effect, but a lack of robust primary evidence to draw precise conclusions. The present network meta-analysis therefore highlights an urgent need for high-quality, directly compared intensity categories, and adequate sample sizes. Such studies are essential to determine whether aerobic exercise intensity meaningfully influences clinical outcomes in KOA.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Kraus V.B. Blanco F.J. Englund M. Karsdal M.A. Lohmander L.S. Call for Standardized Definitions of Osteoarthritis and Risk Stratification for Clinical Trials and Clinical Use Osteoarthr. Cartil./OARS Osteoarthr. Res. Soc.201523123310.1016/j.joca.2015.03.036PMC 451663525865392 · doi ↗ · pubmed ↗
- 2Kellgren J.H. Lawrence J.S. Radiological Assessment of Osteo-Arthrosis Ann. Rheum. Dis.19571649450210.1136/ard.16.4.49413498604 PMC 1006995 · doi ↗ · pubmed ↗
- 3Cui A. Li H. Wang D. Zhong J. Chen Y. Lu H. Global, Regional Prevalence, Incidence and Risk Factors of Knee Osteoarthritis in Population-Based Studies E Clinical Medicine 20202930, 10058710.1016/j.eclinm.2020.100587 PMC 770442034505846 · doi ↗ · pubmed ↗
- 4Salmon J.H. Rat A.C. Achit H. Ngueyon-Sime W. Gard C. Guillemin F. Jolly D. Fautrel B. Health Resource Use and Costs of Symptomatic Knee and/or Hip Osteoarthritis Osteoarthr. Cartil.2019271011101710.1016/j.joca.2019.03.00530922982 · doi ↗ · pubmed ↗
- 5Salmon J.H. Rat A.C. Sellam J. Michel M. Eschard J.P. Guillemin F. Jolly D. Fautrel B. Economic Impact of Lower-Limb Osteoarthritis Worldwide: A Systematic Review of Cost-of-Illness Studies Osteoarthr. Cartil.2016241500150810.1016/j.joca.2016.03.01227034093 · doi ↗ · pubmed ↗
- 6Gibbs A.J. Gray B. Wallis J.A. Taylor N.F. Kemp J.L. Hunter D.J. Barton C.J. Recommendations for the Management of Hip and Knee Osteoarthritis: A Systematic Review of Clinical Practice Guidelines Osteoarthr. Cartil.2023311280129210.1016/j.joca.2023.05.01537394226 · doi ↗ · pubmed ↗
- 7Yan L. Li D. Xing D. Fan Z. Du G. Jiu J. Li X. Estill J. Wang Q. Belal A.A. Comparative Efficacy and Safety of Exercise Modalities in Knee Osteoarthritis: Systematic Review and Network Meta-Analysis BMJ 2025391 e 08524210.1136/bmj-2025-08524241093618 PMC 12522397 · doi ↗ · pubmed ↗
- 8Zheng G. Qiu P. Xia R. Lin H. Ye B. Tao J. Chen L. Effect of Aerobic Exercise on Inflammatory Markers in Healthy Middle-Aged and Older Adults: A Systematic Review and Meta-Analysis of Randomized Controlled Trials Front. Aging Neurosci.2019119810.3389/fnagi.2019.0009831080412 PMC 6497785 · doi ↗ · pubmed ↗
